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Globalize Your Business 



Melissa Data can help you globalize your applications as you 
expand operations to other countries or reach new customers in 
emerging markets. As a world leading data quality vendor, we 
offer solutions to verify, correct and standardize addresses in 
over 240 countries. Eliminate returns, cut postage expenses, 
prevent fraud and keep your customers happy by verifying their 
address before you send a package. 

•Reduce address correction fees - save up to $10 per package 
•Efficiently validate and correct addresses every time you ship 
• Maintain high customer satisfaction 

Accurate data. Delivered. 


Address Verification 
ID Verification 
^ Email Verfication 
GeoCoding 
IP Location 
Name Parsing 
Phone Verification 
Record Matching 


www.MelissaData.com/global 
or call l- 800 -MELISSA (635-4772) 
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Your Partner in Data Quality 


Request a free trial of our Multiplatform APIs and Web Services for rapid application development. 









SpreadsheefGear 



Performance Spreadsheef Componenfs 


SpreadsheefGear 20 I 2 
Now Available 


WPF and Silverlight controls, 
multithreaded recalc, 64 new 
Excel compatible functions, save 
to XPS, improved efficiency and 
performance, Windows 8 support, 
Windows Server 2012 support, Visual Studio 
2012 support and more. 



Excel Reporfing for A5P.NET, 
WinForms, WPF and Silverlighf 


Easily create richly formatted 
Excel reports without Excel 
from any ASP.NET, Windows 
Forms, WPF or Silverlight 
application using spreadsheet 


technology built from the ground up for 
performance, scalability and reliability. 


Excel Compafible Windows Forms, 
WPF and Silverlighf Confrols 


Add powerful Excel 
compatible viewing, editing, 
formatting, calculating, 
filtering, charting, printing 
and more to your Windows 
Forms, WPF and Silverlight applications with 
the easy to use WorkbookView controls. 



Excel Dashboards, Calculafions, 
Charfing and More 


You and your users can 
design dashboards, reports, 
charts, and models in Excel 
or the SpreadsheetGear 
Workbook Designer rather 
than hard to learn developer tools and you 
can easily deploy them with one line of code. 


Free 
30 Day 
Trial 


Download our fully functional 30-Day evaluation and bring Excel Reporting, Excel 
compatible charting, Excel compatible calculations and much more to your ASP.NET, 
Windows Forms, WPF, Silverlight and other Microsoft .NET Framework solutions. 


www.5preadsheefGear.com 


X Visual Studio 


Partner 



SpreadsheefGear 

Toll Free USA (888) 774-3273 | Phone (913) 390-4797 | sales@spreadsheetgear.com 





















Competitive Upgrade Offer 

★ Get Essential Studio Enterprise Edition for $299 (Normally priced at $1,995) 

★ Includes 400+ controls across 8 platforms 

★ Fixed renewal pricing for 3 years 

★ No limit on the number of licenses that can be purchased 

★ Offer applies to anyone who owns licenses from major competitors 

★ Additional restrictions may apply 



Purchase now: 


syncfusion.com/sdtimescompoffer 


■■4 

SSSSyncfusion 









































Contents 


ISSUE 286 • FEBRUARY 2013 


FROM THE EDITORS 


SPECIAL REPORT 


9 Hurray! The enterprise is exciting again! 

9 Do you have Mono? 

10 Feedback 

NEWS 

14 Investors are all about the enterprise 

16 Application Developers Alliance stakes out ground 
in first year 

19 Apache Cassandra 1.2 packs in the nodes 

20 Mono, Microsoft build bridges 



26 Changes in form and behavior drive changes 
in development 

29 Divergent views of supplemental compute 

30 UrbanCode handles release management 

33 Devart updates dotConnect ADO.NET providers 

COLUMNS 

65 CODE WATCH by Larry O'Brien 

What would Map do? 

66 GUEST VIEW by Angus Telfer 

AMOP takes over middleware messaging 

69 ANALYST VIEW by Rob Enderle 

The Fall of Apple in Three Acts 

70 INDUSTRY WATCH by David Rubinstein 

Time to rethink the back end for mobile 


Bugs on the loose! 



page 37 


FEATURES 

IT and developers: 
Hand in hand for MDM 




In-memory data grids set for a big 2013 



i i -r-r-r-i 


page 61 


Software Development Times (ISSN 1528-1965) is published 12 times per year by BZ Media LLC, 7 High St., Ste. 407, Huntington, NY 11743. Periodicals postage paid at Huntington, NY, and additional offices. 
SD Times is a registered trademark of BZ Media LLC. All contents © 2013 BZ Media LLC. All rights reserved. The price of a one-year subscription is US$179 for subscribers in the U.S., $189 in Canada, $229 
elsewhere. POSTMASTER: Send address changes to SD Times, PO Box 2169, Skokie, IL 60076. SD Times subscriber services may be reached at sdtimes@halldata.com or by calling +1-847-763-9692. 

























Bugs are a fact of software. 
Find and fix them faster than ever. 
Happier users, happier developers. 


• Raygun makes error collecting, reporting 
and management automatic and easy. 

• Integration takes minutes and your errors 
are securely stored with the Raygun service. 

• You fix bugs faster. See overall health as well as 
dive deep to get the context to fix a bug fast. 

• Track multiple apps, include your whole team 
and leave no bug behind! 

Start your FREE trial 
at http://raygun.io 
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Systems in Focus 

Award winning modeling & design tools for Software, Business & Systems 



Model Driven Development 


ENTERPRISE 

'SJJ ARCHITECT 


Sparx Systems 

www.sparxsystems.com 


Collaborative modeling & design environment 

Analyze, design & build with exceptional quality & efficiency 

UML® based core, plus many standards based extensions 

BPMN™, SysML, SoaML™, BPEL™ & many others 

Simulation, coding, debugging & visualization tools 

Performance, price & precision 

Full lifecycle support with end-to-end traceability 


Join the community www.community.sparxsystems.com 
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Reuse MS Word documents 
or templates as your reporting 
templates. y 


Integrate dynamic 2D and 3D 
charting to your reports. 



Easy database connection with 
master-detail nested blocks. 



Powerful, programmable 
template designer with full 
sources for Visual Studio®. 



Create print-ready, digitally 
signed Adobe PDF and PDF/A 
documents. 


wuies, columns, images, 
headers and footers and more. 



Visual Studio 


Partner 

US+1 877-462-4772 
EU +49 421-4270671-0 
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OPINION 1 9 



Hurray! The enterprise is exciting again! 


I t has been 14 years since SD Times 
(and SDTimes.com) hit the streets, 
and since then, the Great Wheel of His¬ 
tory has rotated quite a few times. 

We saw Java come, go, and come 
back again. 

We watched the consumer Web take 
over the world through sites like Yahoo, 
Google and Facebook. 

We witnessed the demise of hun¬ 
dreds of companies and their hyped 
technologies, whether through disuse, 
competitive acquisition or irrelevance. 

Lately, we haven’t seen much public 
interest around enterprise IT; the focus 
has been on consumer tech. Indeed, 
other than a brief wave of excitement 
around enterprise service buses, it has 
been quite a while since the enterprise 
was as exciting as it is right now. 

In Silicon Valley and around the 
world, enterprise software is where 
investors have decided to place their 
money. After years of focusing on con¬ 
sumers in software, investors have 


awakened to the fact that enterprises 
are great customers with very deep 
pockets. And, perhaps more important¬ 
ly, developers have long known that 
enterprise software tends to be stale, 
stuffy and business-like. 

Tell that to Atlassian, AppDirect, 
GitHub and Splunk. With the NoSQL 
movement in full swing and the mobile 
revolution well behind us, the metaphor¬ 
ical enterprise has uncrossed his arms 
and come down off the mountain to see 
just what all this hubbub is about. In sec¬ 
onds, a decision-maker can access to data 
from all across the company through 
RESTful APIs that access huge clusters 
of computers, all working to solve prob¬ 
lems for pennies on the dollar. 

Despite this perfect storm of enter¬ 
prise-friendly technology, there are oth¬ 
er factors at play here. During the finan¬ 
cial meltdown and the ensuing two 
years of economic troubles, investors 
lined up to put their dough into con¬ 
sumer-focused startups, and to bring 


such companies to the public markets. 

Anyone who has checked stock tick¬ 
ers recently, however, can clearly see 
that those investments didn’t always 
pan out. With examples like the Face- 
book IPO, Zynga’s struggles and the 
colossal failure of mega-startup Color, 
investors have taken black eyes in soft¬ 
ware startups despite the seeming eco¬ 
nomic dominance of Silicon Valley. 

Not only is the enterprise ready to 
embrace all this new technology, 
investors may be ready to embrace 
those who would develop a company to 
sell to enterprises. And it certainly 
doesn’t hurt that it’s easier than ever to 
sell software and services to business 
customers: Salesforce.com and Amazon 
Web Services have both shown that 
self-service enterprise software is not 
only viable, it’s the right way to go. 

If you are working in enterprise soft¬ 
ware, 2013 is going to be a great year. We 
are excited to see such interest in making 
businesses run more smoothly. I 


Do you have 

M icrosoft-centric developers are 
everywhere, it seems. Yes, of 
course, many development shops focus 
on Eclipse, or build software in Java or 
C++, or focus exclusively on platforms 
like Linux or the Mac. However, many, 
many, many developers live in Visual 
Studio, write code using C# or Visual 
Basic, and target the .NET runtimes on 
Windows servers, the ASP.NET Web 
environment and SQL Server. 

Being an expert in Microsoft devel¬ 
opment is a wonderful career move, 
and is an asset to any Microsoft shop— 
until the developer is asked to target 
something that’s not a Microsoft plat¬ 
form. That’s where Mono comes in, 
offering a runtime for .NET code on 
Android, BSD, iOS, Linux, Mac OS X, 
Solaris and Unix, as well as PlayStation 


Mono? 

3, Wii and Xbox 360. Oh, Mono runs on 
Windows as well. 

The big question for Windows-cen¬ 
tric developers looking to target other 
platforms: What is Microsoft’s relation 
to Mono? After all, Mono is essentially 
reverse-engineered .NET. It’s one thing 
for enthusiasts, hobbyists or small shops 
to use, say, MonoTouch or MonoDroid 
for their mobile apps, or to port their 
ASP.NET code to run on Mono on an 
inexpensive Linux box. 

But bigger businesses, especially 
those who do a lot of business with 
Microsoft or who rely upon the whole 
Windows Server stack, may not wish to 
go that route without Microsoft’s bless¬ 
ing, or without assurance that Microsoft 
won’t penalize them or rapidly render 
their Mono investments invalid. 


The good news is that, after years of 
opposition to Mono, Microsoft appears 
to be changing its tune, or at least isn’t 
singing an anti-Mono song. As you’ll read 
in “Mono and .NET build up bridges” (p. 
20), Mono is in a gray area, where it takes 
advantage of the movement of some 
Microsoft libraries into the open-source 
community, as well as the Microsoft 
Community Promise. Managers within 
Microsoft have even expressed gratitude 
that Mono can bring .NET into environ¬ 
ments where Microsoft’s own develop¬ 
ment will never go. 

At present, Microsoft is leaving 
Mono alone. We believe that it’s not 
going to get better than this anytime 
soon; don’t hold your breath for official 
Microsoft support for Mono. Given 
that, there’s still a risk to base business- 
critical enterprise IT around Mono, but 
it is not a huge risk. If you are consider¬ 
ing Mono, we say go for it. I 
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FEEDBACK 


Time to decelerate 

I for one hope that we hit some solid 
speed LIMIT on the hardware side 
soon (regarding “The Trouble with Ger- 
rold: Using all the petaflops,” available 
at sdt.bz/37169); I want to live long 
enough to see mature, optimized code 
that has been through multiple gener¬ 
ations of evolution within the same 
system resource “box”! Remember 
Borland? Just imagine what software 
would look like after all these years if 
Kahn of Borland’s paradigm had pre¬ 
vailed over the Microsoft/Lotus 
crowd’s lawyers. If that day comes 
when the competition is about the ele¬ 
gance of the execution of the intent of 
the programmer within stable hard¬ 
ware performance limits, I think most 
software will have that Quicken/Side¬ 
kick feel of excellence. 

Guy DeWhitney, United States 

Our own 

programming paths 

It’s funny how our different views of 
the evolution of languages (based on 
“The Trouble with Gerrold: Self¬ 
validating code,” available at 
sdt.bz/37303) on micros is largely 
shaped by which platforms we used. At 
the time (the mid-1980s and onward), 
what we knew seemed like the entire 
universe, without much more than 
UUCP mailing lists and BBSes to talk 
to others in something resembling real 
time. Turbo Pascal was big on the Intel 
PCs at the time (which we used in my 
high school), but the Apple II world 
was very different and still significant 
at that time. And in universities, the 
Unix world, with its ubiquitous C, was 
just emerging (Sun 2s and IBM RTs 
and more at Carnegie Mellon Univer¬ 
sity), and the DEC world with VMS 
and its family of languages had its own 
niche of professional programmers, 
and so on. 

Hell, a year after I wrote a system in 


Turbo Pascal that ran the state math 
competition (Florida Mu Alpha Theta, 
using Scantrons and all that), I was in a 
job writing a medical-insurance billing 
system on PCs in a dialect of (shudder) 
COBOL. I wonder whether anybody 
has compiled a history that tries to 
trace the different cross-influences of 
the different microcomputer comput¬ 
ing environments, both among the 
home hobbyists and the research and 
education programmers at universi¬ 
ties, and the more staid environments 
for billing and accounting on VMS and 


IBM iron (and the odder environ¬ 
ments like Lisp-machine-like TI 
Explorer calculators, and so on). 

We all walked our own different ran¬ 
dom paths through those different 
forests. 

Todd Masco, Pocketdyne 


What do you think? 

Letters to SD Times should include the 
writer's name, company affiliation and 
contact information. Letters become the 
property of BZ Media and may be edited. 
Send to feedback@bzmedia.com. 
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► Putting technology on a canvas 

When you hear about geolocation data, you don't think about art. But the folks 
at Kaazing found a way to throw the two concepts together: "If art is a reflec¬ 
tion of life, then does that make technology a reflection of art? It did recently, 
when Kaazing—the folks behind the commercial distribution of the WebSocket 
high-performance transport protocol—teamed up with the Novela Art Festival 
in Toulouse, France to follow festival attendees on a 'canvas."' What did this 
collaboration mean? Read more at www.sdtimes.com/blog/2142. 

► Getting open source to say 'Hello' to the mainstream 

''Marketing'' usually means money and (perhaps) unsavory messaging. Open- 
source proponents have been reluctant to use it. But the time may have come 
to reconsider, says Chris Barylick: ''The open-source community has some 
incredible successes under its belt, now it just needs to ascend to the 
next level, make itself known, and become more widely known and readily 
considered when decisions are made.'' Do you agree? Read more at 
www.sdtimes.com/blog/2143. 
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SERVERS 




Save $360 


AMD Hexa-Core 


6 Cores x 2,8 GHz 
(3.3 GHz Turbo Core) 


Our data centers offer top security, 

Cisco firewall protection and maximum uptime. 

✓ Unlimited traffic with no extra cost 

✓ Parallels Plesk® Panel 11 for unlimited domains 

✓ Exclusive to 1&1: Optional SUSE Linux Enterprise Server 

✓ Mobile Server Monitoring for Android/iOS devices 


16 GB RAM DDR3 ECC 
1,000 GB (2 x 1,000 SATA) 
Software RAID 1 

Free choice of CentOS, Debian, 
Ubuntu, or openSUSE. 

Unlimited Traffic (100 Mbit/s) 


$ 99 


OFFER ENDS 


1&1 is celebrating its 25 th anniversary. 

Over the past 25 years 1&1 has grown to become 
one of the world's leading web hosts. Today, with 
12 million customer contracts and 5000 employees 
1&1 provides superior web hosting and server 
solutions to support your business. In celebration 
here is a gift from us to you. 


Server XL 6 



* Offer valid for a limited time only. First year save $360 off regular price of Dedicated Server XL 6. Other terms and conditions may apply. Visit www.landl .com for full promotional 
offer details. Program and pricing specifications and availability subject to change without notice. 1&1 and the 1&1 logo are trademarks of 1&1 Internet, all other trademarks are the 
property of their respective owners. © 2013 1&1 Internet. All rights reserved. 

Intel, the Intel logo, Xeon, and Xeon Inside are trademarks or registered trademarks of Intel Corporation in the U.S. and/or other countries. 
















Big Data gets real 
at Big Data TechCon! 

The HOW-TO conference for Big Data and IT Professionals 

Check out this list of classes! 


' Untangling the Relationship Hairball with 
a Graph Database 

• HBase Schema and Table Design Principles 

• A Survey of Probabilistic Data Structures 

• High-Speed Data Ingestion with Sharded 
NewSQL Databases 

• Analytics Maturity Model 

• How to Fit a Petabyte in Apache HBase 

• Apache Cassandra — A Deep Dive 

• Beyond Map/Reduce 

• How to Integrate Structured and Unstructured Data 
with Avro 

• Implementing a Real-Time Data Platform Using HBase 

• Big Data Science: Extracting Truth from Large, 

Multi-Structured Data, Parts I & II 

• In-Database Predictive Analytics 

• Building an Impenetrable ZooKeeper 

• Map/Reduce Tips and Tricks 

• Building Applications Using HBase 

• Mastering Sqoop for Data Transfer for Big Data 



Building High-Volume Web Applications 
Building Successful Data Science Teams 

• Extending Your Data Infrastructure with Hadoop 

• Microsoft’s Big Data Story 

• Data Modeling and Relational Analysis in a 
NoSQL World 

Oozie: A Workflow Scheduler for Hadoop 
• Distributed Search and Real Time Analytics, Parts I & II 

• Selecting the Right Big Data Tool for the Right lob, and 
Making It Work 

• Getting Started with Predictive Modeling: 

Simple Models and Basic Evaluation 

• Hands-on NoSQL for the DBA 

• Getting Started with R and Hadoop, Parts I & II 

• Seven Deadly Hadoop Misconfigurations 

• Hadoop Backup and Disaster Recovery 101 

...and LOTS more! 

Download the full Course Catalog TODAY at 
BigDataTechCon.com. 


Keynotes Announced! 



Oscar Boykin 

Staff Software Engineer 
at Twitter and co-author 
of Scalding 

Addition in the Large: 
Simple Counts and 
Not-So-Simple Counts 


9 Adam Laiacano 

Data Scientist and Engineer 

Improve, Don’t Abuse: 
Use Big Data for 
Your Customers, 
not Against Them 



Dr. Michael Stonebraker 

Database Pioneer and 
Adjunct Professor at MIT 

Big Data Is (at Least) 

Four Different Problems 


A BZ Media Event Big Data TechCon™ is a trademark of BZ Media LLC. 


1 Platinum Sponsor 

Gold Sponsor 

Silver Sponsors 

fjjf LexisNexis' 

MAPR 

Technologies 1 X. 

Cloudera 4^paradigm4 thjnk hid 1 

data-driven discovery 1 











Register Early and SAVE! 
Register by Feb. 22 and Save $300! 


Discover how to master Big Data from real-world 
practitioners - instructors who work in the trenches 
and can teach you from real-world experience! 



Basier Serge Dipti Jeffrey Jaroslav Ben Jean-Daniel John A. Max Jim 

Aziz Blazhievsky Borkar Breen Cecho Coverston Cryans De Goes De Marzi Duey 



Daniel David Mark David Amandeep Lynn Dan Simon Michael Andrew C. 

Eklund Gleich Grover Hwang Khurana Langit Mallinger Metson Miller Oliver 



Claudia Jason Eddie Jonathan Ryan Kathleen Ravi Dean Andrew 

Perlich Rutherglen Satterly Seidman Tabora Ting Veeramachaneni Wampler Wilson 



BigData 

— TECHCON 



www.BigDataTechCon.com 


April 8-10,2013 
Boston, MA 

































Investors are all about the enterprise 

Groupon, Zynga IPOs sour taste for consumer startups 


BY ALEX HANDY 

The enterprise is sexy again. While big- 
dollar investors for much of 2012 chased 
after the likes of Groupon, Zynga and 
even the colossal photo share failure of 
Color, investors are once again turning 
to the enterprise world as a more lucra¬ 
tive place to build a business. 

Carl Rydbeck, vice president at 
investment firm Crosslink Capital, said 
that his company has investors focused 
specifically on both consumer and busi¬ 
ness markets, and that the firm does not 
pick and choose only one or the other. 
“That said, I think there are lots of rea¬ 
sons to be excited about enterprise 
right now,” he said. 

“There’s more innovation there right 
now than there has been in a long time, 
both in terms of business models, like 
SaaS, freemium and recurring revenue, 
as well as technology. Some of these 
changes aren’t new, but they’ve become 
mainstream over the past couple of 
years, along with cloud.” 

Crosslink Capital’s most famous 
investments were decidedly consumer: 
Pandora and Tivo. But the firm also 
recently invested in DataStax, the 
NoSQL company based on Cassandra. 
And while this investment was a tough 
decision at first, the results and growth 
have been extremely encouraging for 
Crosslink. 

This investment interest is somewhat 
cyclical, with the 2000 dot-com boom 
being the last major era of enterprise 
investment enthusiasm. This time, how¬ 
ever, things have changed, said Rydbeck. 

“I think there’s renewed recognition 


now that the enterprise markets are 
enormous in terms of the aggregate dol¬ 
lar amounts that are spent,” he said. “At 
that point [in 2000], there were less con¬ 
sumer dollars spent online, so building 
out IT infrastructure was naturally a 
huge piece of the bubble. If you look at 
the whole economy, consumer markets 
are bigger than B2B markets, but the 
online IT market during the dot-com 
boom consisted of a lot of companies 
building a lot of infrastructure ahead of 
consumer demand. In the mid 2000s, e- 
commerce and the consumer Internet 
continued to grow and caught up, and 
the investors realized how big those mar¬ 
kets were, leading to a lot of investment. 

“I think in large parts of IT, there 
was a dot-com or telecom bust that 
soured investors on certain enterprise 
markets for a while. Meanwhile, the 
consumer Internet continued to grow 
and all these consumer plays got enor¬ 
mous traction, but it’s overly simplistic 
to say that’s it’s a back-and-forth cycle. 
Another key driver of enterprise invest¬ 
ing these days is that for the most part 
exit multiples in the public markets 
have been very attractive.” In layman’s 
terms, that means enterprise compa¬ 
nies that make it to the stock market 
earn lots of money for their investors, 
or a multiple of their initial investment. 

More money in the enterprise 

Those multiples have certainly been 
lucrative even before deals hit the public 
markets. Heroku’s US$250 million 
acquisition by Salesforce, and the fren¬ 
zied investment into new Big Data solu¬ 


tions, are results that could only exist in 
a business world; Big Data isn’t exactly 
relevant to the consumer. And Big Data 
is another area of enterprise technology 
that’s booming thanks in part to NoSQLs 
and the Apache Hadoop project. 

Shanley Kane, director of product 
management at Basho (the company 
behind NoSQL key-value store Riak), 
said that the real reason to be in the 
enterprise market is fairly obvious: It’s 
the money. “Our average deal size is 
much higher than some of our competi¬ 
tors,” she said. “We start out in the six- 
figure range in terms of what our aver¬ 
age deal size is. The number of startups 
that can afford that are close to zero, 
but that’s just a starting deal in the 
enterprise. Enterprises spend millions 
if not billions on their infrastructure.” 

Herb Cunitz, president of Hadoop- 
focused company Hortonworks, said that 
the cyclical nature of investment interest 
in enterprise businesses is nothing new. 
Instead, what’s new is the data available. 
“I’ve been in the industry long enough to 
see this come and go multiple times,” he 
said. “We are seeing that again. 

“Hadoop is one of the technologies 
that is driving this, but it’s not the only 
one. It’s simply a function of the market, 
and the availability of data that didn’t 
exist two to three years ago, and what 
innovators are doing to leverage that info 
and make smarter and faster decisions. 
It’s driving change in the data market, 
and it clearly sits on the enterprise side. 

“I do believe it is making a change, 
and if it plays out as we all think it will, 
this will have very far-reaching implica- 
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tions on how many companies go to mar¬ 
ket, what operating systems and data 
platforms and warehouses they use, as 
well as how end-user companies use it in 
their enterprise to gain advantage. There 
haven’t been many of those transforma¬ 
tional technologies for a couple years.” 

Combine this with the lackluster 
showings of recent IPOs for Groupon 
and Zynga. While both companies saw 
early boosts in stock price, those same 
stocks are now a fraction of their former 
selves. Groupon, once trading above 
$25 a share, is now around $4.50, while 
Zynga s former high above $15 per 
share is now a long way away from its 
price of around $2.50 as of this writing. 

Meanwhile, enterprise-focused busi¬ 
nesses are doing very well. AppDirect, 
an online business application store. 
Daniel Saks, president and CEO of 
AppDirect, said that the business of 
enterprise software is on fire again 
thanks to innovation in what have tradi¬ 
tionally been rather dull markets. 

“In the past, enterprise wasn’t hot, 
and it really sucked,” he said. “Then we 
saw all these companies like Rackspace 
and Box who bring this great attitude 
toward the enterprise space, and we 
wanted to take it a step further.” 

To that end, said Saks, his company 
created an app store for business SaaS 
applications. AppDirect gives users a 
single place to pay all their SaaS bills, 
and it gives service providers such as 
AT&T a generic app store to brand and 
open to their users. 

“There are so many cool develop¬ 
ment tools today. There’s Heroku and a 
lot of cool platforms; a lot of developers 
don’t want to work in these restrictive 
ways,” said Saks. That’s because devel¬ 
opers in enterprises now have one thing 
they’ve never really had before: choice. 
While vendor lock-in was all the rage 


for development tools in the 1990s, 
today’s tools tend to be open source and 
interoperable, allowing developers to 
use what they want, when the want. 
That also means enterprises themselves 
aren’t locked in to old platforms and 
systems that could be dragging down 
productivity. 

The ghost of Color still haunts 

But even without a failed IPO, con¬ 
sumer startups are struggling in the cur¬ 
rent economy. A year ago, a new mobile 
phone startup called Color was hitting 
home runs in the venture capital com¬ 
munity, and shortly thereafter, in the 
media. Before the company’s product 
had even launched, Color had earned 
$41 million in investment capital. 

Just prior to launch day for Color, 
news agencies around the world foretold 
the coming of a messiah-like new compa¬ 
ny, without any information on what it 
did. Local TV news stations ran stories 


NEWS i is 

Despite tremendous investment and 
hype, Color closed down in 2012, with¬ 
out so much as a single success story. The 
company’s failure was a black mark for 
Sequoia Capital, the firm behind eHar- 
mony, Google, Linkedln and Yahoo. Last 
year, Color sold off its engineering 
department to Apple for $2 million. 

Compare these failed IPOs and 
crumbled consumer startups with the 
enterprise-focused data indexer 
Splunk. While Splunk’s actual financials 
aren’t terribly different than Zynga’s 
(both companies have a negative earn- 
ings-per-share ratio at the moment), 
Splunk remains a popular stock on Wall 
Street while Zynga continues to plum¬ 
met toward crisis. And while Splunk 
continues to hire and expand opera¬ 
tions, Zynga is laying off workers. 

Rydbeck believed this is a result of 
the realization in the investment com¬ 
munity that open-source business soft¬ 
ware and the solutions that come out of 


Venture capitalists rush for the consumer exits 


Eager to avoid a repeat of the dot-com 
bubble in the enterprise market, venture 
capital went in hard in the consumer 
space. But, apparently, the bubble still 
followed them. 

In 2009, when FarmVille was one of 
the biggest things on Facebook, Zynga 
was able to pull in $25 million in invest¬ 
ments from such firms like Kleiner 
Perkins and Avalon Ventures. Over the 
next two years, Zynga collected over $1 
billion. But, as Facebook fell, its stock 
price dropped approximately 75% since it 
was introduced in December of 2011. 

Not to be outdone, coupon-offering 

about the forthcoming launch of a mega¬ 
startup, and non-technology-related out¬ 
lets predicted Color would change every¬ 
thing for consumers. 

Then Color launched. The applica¬ 
tion allowed users to share photos 
based on their location. Take a photo of 
the White House, and anyone who 
stands nearby can see it. Lrom afar, it 
sounds like a fairly silly idea: Why 
would anyone want to see photos of a 
landmark taken by strangers, especially 
when they’re already standing within 
viewing distance of said landmark? 


service Groupon made a huge splash in 
January 2011 when it raised $950 million 
from (again) Kleiner Perkins and Silver 
Lake. By 2012, with Groupon's stock price 
crashing through floor after floor, venture 
capitalists like Marc Andreessen had all but 
abandoned the company. Since its block¬ 
buster $950 million round of funding, it 
only raised about $18 million. (Don't worry 
about the early investors, though; they still 
made some money when they cashed out.) 

Fears of a repeat of the dot-com bubble 
didn't keep investors from jumping in, but 
now it's driving them out sooner rather 
than later. 

Lacebook and Google can be turned 
into lucrative businesses. 

“I’m nowhere near the first to make 
this observation,” he said, “but there’s a 
pattern of the giant Web companies like 
Google, Amazon, Lacebook and Twitter 
encountering data and computing 
problems before the rest of the world 
and solving them internally. And once 
there’s a solution, it often becomes 
open-sourced, and then people realize 
the technology can have applicability to 
broader use cases within business. 
That’s becoming widely recognized.” I 
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Application Developers Alliance 
stakes out ground in first year 

Policy advocacy, resource creation provide support for coders 



BY SUZANNE KATTAU 

The first year of the Application Devel¬ 
opers Alliance, the Washington, D.C.- 
based organization that was formed, in 
part, to provide representation for soft¬ 
ware engineers and application devel¬ 
opers, has proven quite productive. 

Founded in January 2012, the 
Alliances membership has risen to 
more than 20,000 individual developers 
and about 120 companies. President Jon 
Potter and senior vice president Jake 
Ward said they cofounded the Alliance 
because they believed there was a real 
need for coders’ voices to be heard. 

“We’d seen representative organiza¬ 
tions pop up around the country that 
represent the companies that employ 
developers,” Ward said. “Those organi¬ 
zations focus on the software as an 
industry, without recognizing the 
importance of the individual coders.” 

The Alliance helps developers with 
what Potter said was developers’ most 
pressing need right now: to know how 
to take an idea or app and turn it into a 
business. This is true for developers at 
startups, he said, as well as for those at 
brands that are trying to extend their 
marketing reach or deliver products 
through a mobile setting. 

“The questions we hear are, What 
are my revenue streams? What are my 
options? Who do I talk to? What’s been 
done? What’s succeeded?’ ” said Potter. 
“And that information is the type of 
thing that we have bundled together 
and put into DevsBuild.lt, our resource 
center that is a library of content for our 
members, from our members, and 
curated and created by the Alliance.” 

DevsBuild.lt was launched in 
December 2012, is still in beta, and is 
free to individual developers. The 
Alliance said it answers development- 
related questions that developers have, 
but it doesn’t address coding at all. “We 
like to say it’s everything but the code,” 


Ward said. 

The Alliance hosted more than 50 
events in 2012, including a countrywide 
mobile app series. It said it plans to host 
a separate series this year on software 
patents, as well as additional confer¬ 
ences, summits, workshops, meetups, 
hackathons and webinars. 

Four working groups were also 
launched in its first year. The groups 
focused on business and revenue; emerg¬ 
ing technologies and research; health 
and medical apps; and mobile payments. 
They are comprised of Alliance members 
and shape Alliance programs, policies 
and resources. They provide core expert¬ 
ise and guidance to the Alliance and its 
members through commissioned 
research, white papers, policy recom¬ 
mendations and best practices proposals. 

Two more working groups—focus¬ 
ing on kids and education, and media 
and entertainment—are scheduled for 
launch this year. 

The Alliance has also helped devel¬ 
opers last year, Potter said, by providing 
advocacy in Washington, D.C., on poli¬ 
cies involving privacy of data in mobile 
apps and on mobile devices. He also 
said they worked closely with consumer 


groups around consumer privacy dis¬ 
closures. In 2013, the Alliance plans to 
continue providing advocacy on these 
and other emerging software develop¬ 
ment topics. 

Individual developers can join the 
Alliance for free by visiting 
AppAlliance.org. Membership gives 
them access to the Alliance’s members- 
only DevsBuild.lt resource. There is a 
fee for companies to join the Alliance, 
with a pricing structure based on num¬ 
ber of employees within their organiza¬ 
tion. Last year, for example, Potter said 
that the cost to join ranged from 
US$200 to $75,000. 

All corporate members receive the 
same benefits regardless of size of their 
organization or the annual fee they pay, 
Potter said. Alliance corporate mem¬ 
bers receive a dedicated page on Devs- 
Build.lt to customize content and dis¬ 
tribute materials to the Alliance’s 
members. Other corporate benefits 
include business development opportu¬ 
nities, marketing support, events and 
programming, market research and 
intelligence, participation in Alliance 
working groups, service discounts and 
policy advocacy. I 
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Apache Cassandra 1.2 packs in the nodes 

Atomic batches, updated query language also featured 


BY ALEX HANDY 

The Apache Cassandra project, on the 
second day of the new year, announced 
the release of version 1.2 to the public. 
This revision adds a number of updated 
and new features, including support for 
virtual nodes, atomic batches, and a 
new version of the Cassandra Query 
Language. 

For administrators using Cassandra, 
losing a node can be a painful experi¬ 
ence. The problem isn’t that a lost node 
hampers read or write performance— 
Cassandra scales to hundreds of nodes 
and beyond, so losing a single server isn’t 
a huge deal. The real problem is that 
when a node is lost and must be rebuilt, 
the sheer size of the data involved can 
make for a lengthy rebuild process. 

But the advent of virtual nodes in ver¬ 
sion 1.2 helps to alleviate this problem, 
said Jonathan Ellis, CTO and cofounder 
of DataStax, a company built to support 
and evolve Apache Cassandra. 

“One of our goals for 1.2 was to sup¬ 
port denser machines,” he said. “We’ve 
always targeted a very scale-out kind of 
deployment with Cassandra, where you 
have machines with relatively small 
amounts of memory, but the price per¬ 
formance hot spot has been favoring 
multiple [terabytes] of disk and 64GB 
of RAM per machine. 

“Specifically, virtual nodes address 
the time to rebuild a dead machine. If 
you have a catastrophic failure and you 
have to rebuild a node entirely, in the 
past we’ve been able to stream data to it 
from three or five machines in the clus¬ 
ter. If you have 100 machines, you’re 
only parallelizing that by a fraction of 
what you could. Virtual nodes let you 
spread out ranges with which you share 
data with other nodes in the cluster. It 
supports up to 256 virtual nodes per 
machine. The idea is to parallelize the 
rebuild across the entire cluster.” 

Thus, a rebuild of a dead node 
should take place in an hour rather than 
a day, said Ellis. 

Additionally, inversion 1.2, Cassandra 


Query Language (CQL) was updated to 
version 3. This structured query lan¬ 
guage was designed, said Ellis, to help 
make developers experienced with rela¬ 
tional databases feel more comfortable 
building with the Cassandra NoSQL. 

“With CQL 3, we’re trying to 
address the pain points we see for peo¬ 
ple designing an application with Cas¬ 
sandra, especially for those coming 
from relational databases,” said Ellis. 
“We’re abstracting the Cassandra stor¬ 
age engine away from the query lan- 



DataStax's Jonathan Ellis says Cassandra 1.2 
is accessible to relational DB developers. 


guage. You can cluster your rows in a 
partition, and that’s a very natural way 
of thinking about the data for someone 
from the relational world. You can also 
do a collection of data types within a 
column, which makes it easier to do 
columnar forms of normalization.” 

That’s an important improvement, as 
NoSQLs generally frown upon joins. 
“When we said ‘no joins,’ a lot of people 
took away from that ‘Cassandra isn’t 
going to do joins, I should be doing 
joins in my application code.’ That’s not 
what we want,” said Ellis. “Doing those 
joins is going to touch lots of machines 
in the cluster, and that’ll kill your per¬ 
formance.” 

Thus, instead of doing joins, Cassan¬ 
dra emphasizes normalization. “Let’s 
say you want to allow users to have mul¬ 


tiple e-mail addresses,” said Ellis. “In a 
relational database, you’d create a sepa¬ 
rate e-mail entry and have a many-to- 
one relationship. In Cassandra, we’re 
allowing you to have a set of e-mail 
addresses as strings, so you can pull 
those out and not have to worry about 
having a separate table or anything.” 

Next on Cassandra's plate 

With Cassandra moving forward and 
gaining traction inside enterprises, Ellis 
said there are a few things on the hori¬ 
zon that should be nearing their time of 
inclusion into the project. The Cassan¬ 
dra mailing list took the release of 1.2 as 
a chance to chat about version 2.0, and 
he had some insight into what should 
be in store for the project throughout 
the rest of the year and beyond. 

“A few things have been on our wish 
list for a while, and 2.0 is going to be a 
good time to make those happen, partly 
because we’ve laid the groundwork in 
1.2,” he said. “We’ve had triggers on our 
wish list for at least a year, probably 
two, and it’s a tough thing to do in an 
eventually consistent system like Cas¬ 
sandra. We have a proof of concept that 
has raw triggers that allow you to run a 
piece of code you’ve injected via JARs 
somewhere. It’s not user-friendly, but it 
shows the concept.” 

One area Ellis would like to see 
improved is the client ecosystem for 
Cassandra. “We’re very interested in 
making the client experience a more 
seamless and positive one rather than 
the way it is now,” he said. 

“If you go out there and look, there 
are a half-dozen Java clients, two Python 
clients, etc. There are all these clients out 
there that the open-source community 
has put together to make it easier to work 
with Cassandra, but for someone who’s 
just gone through the process of choos¬ 
ing which datastore to use, he doesn’t 
want to now go and choose which client 
to write his application with. We’re 
focused on providing best-of-breed 
clients for customers to run with.” I 







20, NEWS 


I SD Times | February 2013 | 


www.sdtimes.com | 



As Miguel de Icaza's brainchild matures, the vitriol 
between the camps seems to have lessened, for now... 


BY CHRIS BARYLICK 

It all comes down to the tools and appli¬ 
cation programming interfaces you 
have to work with. 

At the dawn of the 2000s, with boy 
bands ruling the day, people thinking 
nu-metal would last forever and no one 
quite knowing what the next decade 
might bring, Microsoft pulled away 
from the traditional architecture it had 
used for the Windows 95 and 98 oper¬ 
ating systems and moved toward a 
wider consolidation of its file systems 
and libraries with the .NET Framework 
project. Such an architecture, which 
sought to encapsulate all the functions 
of security, memory handling, interop¬ 
erability, virtual machine operation, 
exception handling, library manage¬ 
ment and support for assorted pro¬ 
gramming languages, became some¬ 
thing of a leviathan, but one that 
controlled a good deal of how the Win¬ 
dows operating systems behaved and 
how to write for Microsoft’s platforms. 

If there’s a problem to be had with 
.NET, it’s the same problem you’ll find 


with any technology that’s an instru¬ 
mental part of any corporation: It tends 
to be available for that company’s prod¬ 
ucts and not much else. And as terrific 
as .NET has become, it’s still largely 
tied down to the Microsoft family of 
operating systems with few, if any, 
expectations that Microsoft might one 
day release its golden geese for non- 
Microsoft operating systems to enjoy. 

Into this breach stepped Miguel de 
Icaza, formerly of Novell and now with 
Xamarin, who’s been working for the 
past eight years to bring .NET to the 
masses via the Mono Project, a set of 
open-source development tools that 
work to ensure that developers are able 
to run .NET applications across a wider 
array of operating systems. (The current 
list includes Android, BSD, iOS, Linux, 
OS X, Solaris, Unix and Windows, as 
well as assorted game console such as 
PlayStation 3, Wii and Xbox 360). 

The Mono Project has done what the 
open-source movement is great at doing: 
taking something that’s closed off, 
reverse-engineering it, and releasing it 


to as wide an audience as possible—in 
this case, with considerable success. 

Similar to .NET in many ways, the 
Mono Project has stated that its current 
aim is to support the feature base found 
in .NET 4.5—except for the Windows 
Presentation Foundation, Windows 
Workflow Foundation and Windows 
Communication Foundation file for¬ 
mats. (Additional efforts to support 
these formats are under way via the 
Olive libraries.) 

Over the past several years, Mono 
has begun to come into its own, with the 
current version incorporating many of 
the elements of the .NET Framework. 

Branches on Mono's tree 

Like .NET, the Mono Project has added 
support for a wide variety of program¬ 
ming languages, including C, C++, 
Python, Java and Vala. The runtime ver¬ 
sion also shipped with two garbage col¬ 
lectors that work together to appropri¬ 
ately free up resources between the 
application and operating system. 

continued on page 22 ► 
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Mono, Microsoft build a bridge 


◄ continued from page 20 

Other projects that have spun off 
from the Mono effort include open- 
source media players; Android OS builds 
for Droid handsets as well as Apple’s iOS 
devices; cross-compatible Cocoa and C# 
development tools for the OS X plat¬ 
form; game-creation tools; and cross¬ 
compatible plug-in developments 
between the Mono Project and Micro¬ 
soft’s Visual Studio application (some¬ 
thing of an informal alliance, but still 
readily available to those who want it). 

Which brings about the most perti¬ 
nent question for the development 
community: Where do Microsoft and 
the Mono Project stand in relation to 
each other, especially given that Mono 
somewhat exists to reverse-engineer a 
Microsoft property, thus bringing simi¬ 
lar functionality to non-Microsoft oper¬ 
ating systems and devices? 

“In the last decade, we went from a 
monocultural world dominated by Win¬ 
dows to a multiplatform world,” said de 
Icaza. “I know that you might be think¬ 
ing, 'Well, the guy works on Mono,’ and 
it is a common reaction by those that 
are not familiar with .NET. But the 
framework and the ecosystem are a 
delight to use, and you soon realize that 
there is some top-notch engineering 
going on its design. 

“We started the work on the Mono 
Project since we were frustrated with 
low-level languages to build large appli¬ 
cations, and at the time, high-level lan¬ 
guages were just too slow to power 
those applications. So C# and .NET 
were a good fit, a balance of perform¬ 
ance and high-level features.” de Icaza 
went on to compliment .NET’s progress 
over the years. 

“Luckily, the framework has evolved 
over time,” he said. “.NET is not the 
same beast that it was back in 2001. It 
has kept up very well with the new 
understanding about the world. .NET 
brought full generic programming 
sometime around 2003, then the C# 
language went through iterations that 
added, first, generics support, then 
functional programming constructs, 


then dynamic constructs, and most 
recently support for beautifully han¬ 
dling asynchronous programming. 

“.NET was designed very much at a 
time when Windows was king, so every 
once in a while there is an API that is 
sadly designed with Windows in mind. 
Luckily, we have been able to replace 
those in many cases with cross-platform 
ones, or they have become deprecated. 
Although there are things that I would 

Supported elements in Mono 

■ Visual Basic .NET-compatible compiler 

■ Moonlight ("Silverlight" variant) 

■ C# compiler 

■ Virtual machine 

■ Development tools for graphical user 
interface development (Gtktt, etc.) 

■ Integrated debugger 

■ Mozilla-compatible libraries 

■ Security stack 

■ Database connectivity libraries 

like to fix in the core, purely from a per¬ 
fection perspective, they are livable. 
And the good things that have been 
built around it make up for the few mis¬ 
takes here and there in the core.” 

Microsoft's handling of open source 

At this point, there’s something of a 
truce, and even Microsoft is reaching 
out to the open-source community to a 
small extent. The Mono Project cur¬ 
rently falls under the Microsoft Com¬ 
munity Promise, which functions as an 
established covenant to not assert legal 
rights or action over certain Microsoft 
patents on implementations. As such, 
there exists some wiggle room within 
Microsoft’s patents, and Mono’s code, if 
found in violation of these patents, 
could be altered so as not to infringe 
upon Microsoft’s legal claims. 

To date, Mono seems to exist in a 
gray area and has even been regarded 
askance within the Linux community. 
Back in 2009, it was unknown as to 
whether to include elements of the 
Mono Project within Fedora 12—Red 
Hat’s Linux distribution—given its legal 


between them 

status. Fedora project leader Paul 
Frields has stated that “We do have 
some serious concerns about Mono and 
we’ll continue to look at it with our legal 
counsel to see what if any steps are 
needed on our part,” yet that the Fedo¬ 
ra Project has “yet to come to a legal 
conclusion that is pat enough for us to 
make the decision to take Mono out.” 

“Microsoft has already moved 
libraries like Entity Framework into 
open-source development,” said Bran¬ 
don Bray, group program manager of 
Microsoft’s Managed Languages team. 
“Right now, Microsoft has no tooling for 
(non-Microsoft operating systems). It’s 
good to see Mono filling this space. In 
fact, C# is probably the most portable 
programming language on phone plat¬ 
forms today because of Mono,” he said. 

“The downside of having such a 
large programming API and being on 
so many platforms is that developers 
have to spend a lot of time hunting for 
libraries or tutorials,” he said regarding 
the current scale of the .NET effort, 
and where Mono and open-source 
efforts fit into this. 

Bray then went on to endorse the 
NuGet effort, which functions as an 
open-source package manager for the 
.NET Framework and is distributed as 
a Visual Studio extension. 

“Mono has been around since the 
standardization effort for .NET began 
in ECMA,” Bray said. ECMA has creat¬ 
ed a standard that defines the Common 
Language Infrastructure (CLI), in 
which applications written in multiple 
high-level languages can be executed in 
different system environments without 
the need to rewrite those applications. 
“The Mono Project has been a great 
example of cross-platform capability,” 
he added. “Microsoft has frequently 
collaborated with the Mono team.” 

Where developers themselves were 
concerned, feelings toward .NET 
seemed somewhat ambivalent, albeit 
some wished for a wider support basis 
from Microsoft regarding the Mono 
Project. 

continued on page 24 ► 
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Mono, Microsoft build a bridge between them 


◄ continued from page 22 

“I haven’t used Mono for anything 
production-related, mostly just quick 
testing or trying to compile something 
needed for Linux,” said Tim Dona- 
worth, a security engineer for Gemini 
Security Solutions. “But I honestly think 
it’s a great thing. I wish MS would fur¬ 
ther open the framework. I don’t know 
if they support Mono, but from what 
I’ve heard, they’re not trying to stop it.” 

Donaworth went on to praise Micro¬ 
soft’s support of additional libraries to 
.NET, which have helped him in his 
work. “I think the addition of LINQ was 
the biggest change for me though. Being 
able to take an object-oriented approach 
to databases, or even to use previous 
SQL experience on objects, now made 
managing data a breeze.” 

Building an F# bridge 

If a grand compromise exists between 
Microsoft’s .NET Framework effort 
and the Mono Project community, it 


can be found in the F# programming 
language, which de Icaza called “cur¬ 
rently one of my favorites.” The F# 
tools, which can be downloaded from 
the F# Software Foundation, function 
as an open-source, cross-platform CLI 
language that can also generate 
JavaScript and GPU code when neces¬ 
sary. The language, which was devel¬ 
oped via a collaboration between the 
F# Software Foundation and Microsoft, 
is currently supported in Microsoft’s 
Visual Studio development along with 
Mono, MonoDevelop and SharpDevel- 
op. It also functions on the Android, 
iOS, Linux, OS X and Windows operat¬ 
ing systems. 

Even if Mono has yet to completely 
break out and become the de facto pro¬ 
gramming standard for all, it’s presently 
finding its way into some useful tools. 
Per Bruce Morrison, producer at video 
game development company Man Up 
Time, Mono has had elements used in 
the Unity game development engine, 


which Morrison has come to know and 
love in his day-to-day work developing 
games for the iOS platform. 

“Unity is extremely good,” said Mor¬ 
rison. “It has some technical limits, but 
the power and speed of development it 
brings can’t be beat.” 

Morrison went on to add that he sup¬ 
ports the idea of Mono helping .NET 
Framework capabilities reach more plat¬ 
forms, especially iOS. Still, he felt the 
integrated development environment in 
Mono Develop could stand to see some 
improvement in future versions. 

Other developers sided with 
Microsoft’s .NET Framework architec¬ 
ture but still liked how the Mono Proj¬ 
ect was evolving as a development tool. 
“I’ve made a couple attempts at getting 
my game to compile with MonoGame, 
but I haven’t had any luck yet,” said Ian 
Stockton, a self-taught game designer 
and developer with experience in both 
.NET and Mono, with a preference for 
the C# programming language. “One of 
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'Microsoft has no tooling for non-Microsoft 
operating systems. It's good to see Mono 
filling this space. Ctt is probably the 
most portable language on phone 
platforms today because of Mono.' 



these days, I’ll need to sit down with 
someone who knows his way around the 
framework so I can get it working. In 
terms of being cross-platform between 
Xbox and PC, XNA is amazing. Aside 
from some resolution and file system 
things, it’s almost automatically cross 
platform.” 

Stockton then detailed the changes 
he’d like to see made with the Mono 
Project, especially within the realm of 
getting new developers acclimated to 
the technology. “An easier installation 
process would be nice,” he said. “When 
I tried it last, I think I had to install 
about five different programs just to 
access the codebase. A walkthrough for 
getting the examples to compile would 


also be great. 

“XNA has spoiled me. Now I feel 
entitled to zero-friction new projects 
and deployment. If only MonoGame 
were so simple!” 

The relationship between Microsoft 
and Mono is and has always been a 
strange one (if not incendiary at times), 
complete with all the hesitancy, caution 
and fervor of a very large software com¬ 
pany defending a key technology, and 
the open-source community doing what 
it can to bring .NET’s feature set to its 
operating systems of choice. 

Even so, there seems to be a bridge 
slowly forming between the two camps. 
In eight years’ time, Microsoft has 
respected its Community Promise, left 


Mono alone and seems to be collaborat¬ 
ing well with de Icaza’s brainchild, even 
if there are mutterings among develop¬ 
ers that they’d like to see more tools 
from Microsoft available to the Mono 
side of the camp. The Mono Project, in 
turn, has thrived, created what tools it 
could to work directly with the .NET 
Framework architecture, and cranked 
out some amazing open-source tools 
with no signs of slowing down or stop¬ 
ping. 

No, the relationship between the 
two sides isn’t perfect, nor will it ever 
be. The two efforts serve different 
needs and different creators, even if 
their goal of creating killer developer 
tools for their respective user bases is 
the same. The bridge between the 
.NET Framework and Mono architec¬ 
tures may never be complete, but it is 
gradually extending, with both sides 
offering a maturing set of tools to work 
with as well as collaborating with the 
other side’s tool set. 

And that’s good news to any develop¬ 
er looking for a better means of finish¬ 
ing his or her project. I 
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Changes in form and behavior 
drive changes in development 

Mobile apps need flexibile interaction options built in 


BY SUZANNE KATTAU 

A wide range of new client form factors, 
plus changes to the way people will use 
mobile applications, will increase devel¬ 
opers’ need to build flexibility into their 
applications, according to “The Future 
Of Mobile Application Development,” 
a Forrester report that was released on 
Jan. 17. 

The report, written by analysts Jef¬ 
frey Hammond and Julie Ask, explains 
how modern mobile applications will 
evolve over the next few years and what 
development teams should prepare for 
now. If you’re a developer looking to 
build modern applications that will 
deliver contextual customer experi¬ 
ences, there are UI changes coming 
that you will need to understand, 
according to the report. 

“When developers design user 
interfaces today—whether it’s for a 
laptop or for a mobile device—we 


think that that application is totally 
what the user’s going to be focused 
on,” said Hammond, principal applica¬ 
tion development and delivery analyst 
at Forrester. “So we try to design the 
applications to catch the user’s eye, to 
involve them, to totally engross them. 
But think about what’s going to hap¬ 
pen when they have to be able to see 
the road or make sure that they don’t 
trip and fall while they’re getting cues 
from these applications.” 

Hammond said the way developers 
design apps is going to become much 
more of a peripheral interaction than 
the main focus of interaction. “That’s 
going to be a bigger shift for developers 
than we think because we’re used to 
having the exclusive attention of the 
user,” he said. “Soon, that’s not neces¬ 
sarily going to be the case.” 

UIs are going to change focus from 
touch to voice, according to the report. 


“When you strap that device onto your 
arm (before a run, for example), you’re 
not going to look at it anymore until 
you’re done with your run. Voice takes 
over,” Hammond said. “Also, when 
you’ve got applications that are in the 
car, the natural mode also shifts to 
voice. But you don’t only want to have 
voice because, if you’ve got an applica¬ 
tion and you’re in a crowded restau¬ 
rant, you’re going to want to shift back 
to touch. So developers have got to be 
able to essentially support multiple 
modes of input.” 

Hammond said that developers 
need to even understand which mode is 
going to be the preferred mode, 
depending on the situation. He said 
there’s no reason for developers to not 
take advantage of the microphone that 
detects ambient sound and says, “Oh, 
well, with the decibels here, I can’t real- 
continued on page 28 ► 


What developers can take away from the report 


Flexibility in architecture is key: There's still an awful lot of 
change over the horizon, according to Hammond. “We're not at 
the end state, where all developers have to do is worry about 
writing Android apps or iOS apps," he said. “That may keep you 
for the next year and even the next two years, but there are 
more devices coming and we're probably going to have more 
than just two operating systems that we'll have to care about on 
the client. So, flexibility in architecture is key, as is understand¬ 
ing that these things are connected systems." 

When you have the option of trading flexibility for speed, said 
Hammond, you need to think very carefully about that. “Folks 
need to think very carefully about where it makes sense to do 
platform-specific optimizations and where maybe Web technol¬ 
ogy can help them save a little bit of money so that they can hit 
the maximum number of platforms," he said. 

Functionality will blend into the platform itself: “I think one of 
the things that developers should understand is that the nature 
of the app itself is changing," said Hammond. “We're starting to 
see the functionality blend into the platform itself. So, whether 
it's something like the Apple Passbook where, if you're a brand, 
your loyalty cards now show up in the Apple Passbook, or 
whether it's Google Now where, if Google makes the APIs avail¬ 


able to use their voice recognition, or 
if Siri does the same sort of thing, now 
you're just tapping into the device." 

A trend that Hammond sees con¬ 
tinuing is the one in which mobile 
apps become a little bit less single and 
modular, and instead integrate more 
tightly with the platforms themselves. 
“So this idea that I'm creating my own 
standalone app and it's one of 
500,000 in the AppStore and it's 
impossible for people to find, gets 
replaced," he said. “It gets replaced 
by, what sort of context, what sort of 
services am I tapping into, and how do 
I participate in that customer experience?" 

It will be about more than voice and touch: “I think the biggest 
thing that developers will need to keep in mind is that mobile 
development won't be just about what the eyes see or what the 
fingers touch. It will also be about what the voice says, what the 
ears hear and how they all work together," Hammond said. 

—Suzanne Kattau 



Architecture flexibility is 
another big trend, says 
Forrester's Hammond. 
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Changing forms drive changes in development 


◄ continued from page 26 

ly understand voice input so I’m going 
to automatically shift to text input or 
touch input instead.” 

Wearables and connectables are 
definitely coming in the near future, 
the report said. “I don’t know if you’ve 
seen some of the prototypes that have 
been shown around on batteries that 
are flexible,” Hammond said. “So you 
can conceivably have a battery woven 
into your jacket and be charging your 
cell phone all day long with a 10-ounce, 
flexible battery that’s in the back panel 
of your jacket. Or imagine shoes that 
can generate power while you’re walk¬ 
ing around. If you’ve got that battery 
panel hooked up to little power gener¬ 
ators in your shoes and you go for a 40- 
minute run, you’ve just generated 
enough power to power your cell 
phone for the rest of the day.” 

There’s also no reason we can’t see 
these mobile devices change pretty rad¬ 
ically in form factor, Hammond said. 


Mobile app development 
changes a-coming 

■ Apps will augment voice input and 
prioritize it over touch. 

■ Uls will need to adapt to heads-up 
interfaces. 

L Touch input devices will require adap¬ 
tive Uls to deal with multiple device 
sizes. 

■ Standalone mobile apps will evolve 
into pluggable mobile services. 

■ Wearables and connectables will her¬ 
ald a fast-changing local network of 
customer context. 

■ Better Web support will advance the 
economics of the hybrid application 
model. 

Source:"The Future Of Mobile Application 
Development,"Forrester, Jan. 17,2013 

“Why do you have to have the 4-inch 
device up by the side of your face?” he 
asked. “If you have a pair of glasses on, 
who’s to say you can’t have a brick sit¬ 
ting in your briefcase or your attache 


that’s doing most of the processing and 
the glasses are essentially the local 
dumb terminal, if you will. It’s kind of 
like the fifth generation down from a 
3270 terminal.” Hammond said that by 
2014, those types of glasses will defi¬ 
nitely be available. 

Hammond wrote a blog post to go 
with the report, stating that the best 
way for developers to develop mobile 
apps is, well, not to develop mobile 
apps. He was just making the point, he 
said, that these apps are going to get 
very large and very complex. 

“It’s the new way of building,” he 
said. “It’s almost like how we really 
don’t talk about client/server apps 
anymore, or Web applications; it’s just 
how we build apps now. We just 
assume that that’s the architecture. I 
think, if you were to go five years in 
the future and look back at today, at 
how people are building apps now, 
they’re going to be building applica¬ 
tions this same way.” I 
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Divergent views of supplemental compute 

Xeon Phi co-processor illustrates different strategies of Intel and nVIDIA 


BY ALEX HANDY 

After five years of work on CUD A for 
supplemental compute, nVIDIA is 
squaring off with the biggest chip-mak¬ 
er in the world. These two titans of pro¬ 
cessing cores are both pushing PCI-X- 
based methods of expanding the 
compute power of desktops, but their 
respective approaches and solutions 
couldn’t be any more divergent. 

The new Intel Phi co-processor is 
quite different from an nVIDIA GPU. 
The Phi is a MIMD (Multiple Instruc¬ 
tion, Multiple Data) machine, while the 
GPU is SIMD (Single Instruction, Mul¬ 
tiple Data) machine. The Phi is useful 
only for HPC, while the GPU can run 
games and graphical simulations as 
well. The Phi runs its own Linux and 
presents as a cluster, while GPUs are 
treated as on-device equipment and 
managed alongside the CPU. 

But beyond the technical differences, 
the two companies are already showing 
that they have completely different 
approaches to the growing HPC market. 
Intel has focused on bringing its existing 
compilers and tools to this new process¬ 
ing platform, while nVIDIA has spent 
the past five years building a community 
around its CUD A platform, and around 
GPU compute in general. 

Ian Buck, general manager of GPU at 
nVIDIA, said that “Fundamentally, for 
HPC, it’s about expressing the paral¬ 
lelism.” To that end, developers using an 
nVIDIA GPU to run their code can gen¬ 
erate a single thread, and have it repli¬ 
cated across the GPU cores into the tens 
of thousands of threads. He said this par¬ 
allelism is easy to get with CUD A. 

“If you try that on 60 cores on a chip, 
it’s a little less clear how you program 
it,” said Buck, referring to Intel’s Phi. 
“You can treat it like an MPI (message¬ 
passing interface) cluster on a chip. The 
challenge of that programming model is 
each core is wimpy on its own: It’s a 
Pentium Pro-type processor.” 

But James Reinders, director of soft¬ 
ware products and multicore evangelist 


at Intel, said this is not a hindrance for 
Phi. Using the MPI programming model 
to treat the Phi as if it were a Linux clus¬ 
ter makes this desktop HPC environ¬ 
ment familiar to HPC developers, who 
have been using MPI for some time. 

“I think of it as SMP [symmetric mul¬ 
tiprocessing] on a chip,” said Reinders. 
“You’ve got a collection of processors, a 


cache-based architecture with vector 
units, all brought together with extreme¬ 
ly high-performance interconnect. What 
we’ve done is put it on a single chip.” 

And as for the belief that Phi is a col¬ 
lection of Pentium Pros, Reinders put 
that to rest. “First of all, they’re not Pen¬ 
tium Pros,” he said. “We’ve said in the 
past that it’s essentially a Pentium core, 
but the problem with talking about it like 
that is that Pentium cores didn’t have 
vector units. We have very wide vector 
units: 16 floats wide. It also has four 
threads per core. It has 64-bit support, 
machine exception handling, and power 
states. The Pentium had none of those. 

“The only reason we ever mentioned 
it was sort of like a Pentium was to get 
people thinking about the in-order exe¬ 
cution. It’s not as high-performance per 
thread. But it’s a better solution to over¬ 
all power consumption.” 

These two companies are also 
elbowing each other over nVIDIA’s 
decision to branch out and build Ope- 
nACC, rather than build accelerator 


directives within the OpenMP stan¬ 
dards process. These directives can be 
used to tell an instruction where to 
process: on the SIMD GPU or the 
MIMD CPU. For nVIDIA, that’s what 
OpenACC attempts to create. But 
Reinders said the OpenMP team was 
already working on a CPU- and GPU- 
compatible solution to the problem. 


Said nVIDIA’s Buck, “I think Intel 
and nVIDIA agree that directives are a 
good way to get operations off a CPU 
and onto a GPU by expressing regions 
that can be loops and use parallelism. 
But OpenMP takes time. They’re a 
standard, they have many years of lega¬ 
cy work to do. Instead of waiting for 
that to happen, we’ve been pushing 
OpenACC as an alternative. It supports 
GPU, CPU, PGI, Cray.” 

But Reinders expressed frustration at 
nVIDIA’s decision to create OpenACC. 
“I do not think OpenMP is moving slow¬ 
ly. I think I would question anyone who 
would assert that,” he said. 

“Standards bodies should be very 
careful to standardize things that we 
can all survive with. OpenACC is a sub¬ 
set of what the OpenMP committee 
was working on, but it was a subset 
designed only to service GPUs.” 

However, Buck claimed nVIDIA has 
always planned on contributing the 
OpenACC standard and code back into 
OpenMP. I 
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UrbanCode handles 
release management 

New uRelease tool helps coordinate 
changes across multiple applications 

BY SUZANNE KATTAU 

Adding to its set of DevOps tools, UrbanCode announced last 
month the general availability of the uRelease enterprise 
release-management and coordination platform. 

uRelease allows DevOps teams to plan, execute and track 
releases all the way to production. There are three specific 
problems that uRelease addresses, according to the company. 
The first one is the coordination of the changes across multi¬ 
ple applications that need to go out together, which typically 
have dependencies on each other. 

The second problem is that, when rolling these changes 
into production, typically it’s not a ques¬ 
tion of just deploying software. Usually 
there are infrastructure changes that need 
to be made as part of a release as well. 

“Before today, there was no single system 
that kept track of both sets of changes,” 

UrbanCode CEO Maciej Zawadzki said. 

“It has the ability to keep track of the 
infrastructure changes as well as the appli¬ 
cation changes that are part of a single 
release.” 

Zawadzki said that the third problem 
addressed by uRelease is the management 
of interdependencies among applications. To deal with this, he 
said, environment-management capabilities have been added. 

For example, during QA testing, instances of these appli¬ 
cations must be tested in multiple environments. “Typically, 
our customers have many different QA environments. And 
different versions of the same application may exist in differ¬ 
ent QA environments,” Zawadzki said. “For some applica¬ 
tions, because they’re just big, heavy applications, they may 
not exist in every QA environment; they may only exist in one 
or two QA environments because they’re big, heavy, complex 
and expensive. So it’s not economical to have them in every 
QA environment. But yet, for the purposes of testing, we 
need to create a complete picture.” 

Creating this complete picture is difficult, according to 
Zawadzki. “This means that there’s this complexity of, 'Okay, my 
Application A is in QA environment 1. But in order for it to talk 
to Application B, I want to use the instance of Application B 
that is in QA environment 2. So, for the purposes of testing the 
release, I’m using the instance of Application A from QA envi¬ 
ronment 1, but it’s talking to the instance of Application B in QA 
environment 2.’ So we’re creating this logical QA environment 
for the release that spans, really, the physical environments QA 
1 and QA 2.” I 



DevOps teams can 
now track releases, 
says UrbanCode's 
Zawadzki. 
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Devart updates dotConnect ADO.NET providers 
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Developers creating models can set the target Entity Framework version in the Model Settings. 


BY SUZANNE KATTAU 

To simplify database-related applica¬ 
tion and website development, data¬ 
base connectivity solution provider 
Devart last month updated its family of 
dotConnect ADO.NET data providers. 
Updated versions of dotConnect are 
now available for MySQL 7.4, Oracle 
7.5, PostgreSQL 6.4, Salesforce 2.2 and 
SQLite 4.4. 

Devart, a partner in the Microsoft 
Visual Studio Industry Partner pro¬ 
gram, said its enhanced database con¬ 
nectivity solutions were built over the 
ADO.NET architecture and now sup¬ 
port Entity Framework 6 Alpha 2 and 
LinqConnect ORM solutions. 

In order to support Entity Frame¬ 
work 6 Alpha 2, the company said it 
included new assemblies compiled 
under Microsoft’s .NET Framework 4.0 
to the installation packages of its data 
providers. It said it rebranded assembly 
versions as version 6 to distinguish 
them from the corresponding Entity 
Framework v4 and v5 assemblies. 

With Devart’s enhanced data 
providers, developers can now use 
Entity Framework 6 Alpha 2 features 
such as custom code-first conventions, 
which let them configure settings such 


as default property mapping. The com¬ 
pany has also added support for enums, 
as well as code-based configuration. 
Developers will need to download the 
corresponding Entity Framework 
NuGet package from Devart s website 
in order to use it in their applications. 

Entity Developer, Devart s own ORM 
model designer, is supplied with the data 
providers. Entity Developer allows 
developers to create Entity Framework 6 
Alpha 2 models. When creating or edit¬ 
ing an Entity Framework 6 Alpha 2 mod¬ 


el, the company said, developers can set 
the target Entity Framework version in 
the Model Settings. Entity Developer 
then generates valid code that is compat¬ 
ible with Entity Framework 6 Alpha 2. 

The new versions of Devart s dotCon¬ 
nect ADO.NET providers also include 
an additional Devart.Data.Oracle.Enti¬ 
ty. SharpMap.dll assembly—compiled 
for .NET Framework 4.0—to support 
the SharpMap library. SharpMap is a 
geospatial-mapping library for use in 
Web and desktop applications. I 


In other component news... 

Document, content and imaging solution provider AccuSoft 
recently announced Barcode Xpress v9, an updated barcode SDK 
that allows developers to write QR Codes and read GS1 DataBar 
barcodes when developing applications. (The GS1 DataBar is a bar¬ 
code symbology originally designed to replace the UPC barcode 
format.) Barcode Xpress v9 supports all four QR Code modes: 
numeric, alphanumeric, binary and Kanji. 

Imaging tool provider Atalasoft recently announced Atalasoft 
10.3, its latest major SDK release. Developers can now use it to 
build mobile document-viewers with annotations. The Dotlmage 
Web Document Viewer controls are now supported on iOS 
Mobile Safari and Android's Chrome mobile browsers. This also 
includes annotation support with touch interfaces, zooming and 
pagination. There are PDF generation updates as well. In DotPdf, 


developers can import SVG drawings and use the new Table- 
Shape to add tabular data to a PDF. There is also a Java Barcode 
Reader add-on. Joltlmage, Atalasoft's Java SDK, now supports 
the same ID and 2D barcode sets that its .NET eguivalent has. 

Microsoft component solution provider ComponentOne (a divi¬ 
sion of GrapeCity) recently released Spread WinRT, for use in 
Microsoft Windows Store applications. Spread WinRT combines 
grid capabilities and spreadsheet functionality, along with an API. 
Spread WinRT data visualization support includes sparklines, con¬ 
ditional formatting, icon sets, data bars, color gradients, Excel- 
style grouping, and filtering with Excel-style dialog support. 

Data grid and data-compression component provider Xceed 
recently updated Xceed Extended WPF Toolkit Plus, the commer¬ 
cial version of its CodePlex project. Version 1.9 now features 
more than a dozen new 2D WPF layouts, which work with Switch- 
Panel, the product's core layout-to-layout animation control. I 
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But these best practices can help you defeat them 


F inding and fixing defects in soft¬ 
ware is as old as the practice of 
development itself. And, like 
development, defect tracking is done 
many different ways by different 
organizations, depending upon the size 
and distribution of the team, business 
priorities, and even the development 
process used. 

Many organizations will use an algo¬ 
rithm to help decide what defects get 
fixed, and when. It can range from 
“Don’t ever ship software with bugs” to 
a ranking system, and it will include 


BY DAVID RUBINSTEIN 

such calculations as the number of peo¬ 
ple impacted by the defect, how likely 
the defect is to come up, how much it 
costs when it does arise, and how much 
support the defect is generating. 

Methodologies such as agile devel¬ 
opment and continuous integration 
and deployment also have changed the 
game, but in ways that may seem coun¬ 
terintuitive. One might think going 
faster creates more defects and leaves 
less time to find and remediate them, 


but just the opposite is true. Agile 
development means focusing on small¬ 
er pieces of the project, so the amount 
of change is much smaller, and, accord¬ 
ing to Jonathan Nolen, a senior devel¬ 
opment manager at software tool 
provider Atlassian, “the chances for 
you to create unintended conse¬ 
quences are less.” 

In todays world, “the acceptable lev¬ 
el of quality you’d release has to be 
higher,” said Alex Perec, senior product 
manager at TechExcel, which makes 

continued on page 38 ► 
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‘If you ship a bug and you know it 
will be out there in the field for a 
minimum of two years, then the cost 
in terms of customers hit—dollars 
hit—the pain cost is very high . 1 

—Jonathan Nolen, Atlassian 


◄ continued from page 37 

development tools. “You want to 
address issues as quickly as possible.” 

This article will crystallize the 
thoughts and opinions of leading 
experts into a best-practices guide to 
defect tracking and remediation. 

BEST PRACTICE ttl 

DEFECT TRACKING IS NOT A 
STANDALONE TASK 

“Ifs really hard to talk about just defect 
tracking in isolation from the whole rest 
of your entire project setup,” Nolen 
said. “Its a very holistic activity, because 
anytime you choose to fix a bug, you’re 
essentially choosing not to do some¬ 
thing else. So you always have to bal¬ 
ance fixing that bug versus the other 
thing you would like to be doing. Tying 
that decision-making process into your 
entire development team and process is 
very important.” 

Defect tracking must be part of the 
overall development and planning 
process, according to AccuRev vice 
president of product management Brad 
Hart. “If you say you’re doing agile 
development and you’re planning your 
next sprint, there’s going to be stories 
that you accept into the sprint, and 
you’re going to use your agile planning 
tool to manage that, but you should also 
be planning the defects that you’re 
going to accept. It can’t be as one-offs 
or on the side,” he said. 

Assigning roles is a good way to keep 
moving forward on a project while also 
making sure the software’s defects are 
being corrected, the experts said. 

Paula Rome, director of product 
management at Seapine Software, rec¬ 


ommended that organizations identify 
one person responsible for doing defect 
“triage”—assessing if the defect needs 
to be addressed at all, and if so, when, 
and by whom. “What is your triage 
process? Who’s responsible? How often 
should they look at new issues to catego¬ 
rize, prioritize and assign?” she asked. 

Atlassian’s Nolen agreed with Rome, 
and said that at his company, that role is 
called “the disturbed.” This, he 
explained, “allows the majority of the 
team to be as undisturbed as possible. 
One guy looks at all incoming bugs, 
does triage, and fixes them himself.” In 
this way, people who file bugs get a 
quick response, and the whole team 
doesn’t have to be manning the bug 
queue, he added. 

BEST PRACTICE tt2 

DEFINE YOUR PROCESS, AND 
MAKE SURE IT'S UNDERSTOOD 

Creating a process for dealing with 
defect fixes offers efficiencies and can 
be tailored to how an organization likes 
to work. 

The process, though, “is more com¬ 
plicated than being a state traffic cop,” 
offered Rich Bianchi, CEO of software 
provider Alexsys. “All this is a workflow 


problem that might be multi-faceted.” 

Alexsys’ tools can be customized to 
enable users to create different forms, 
layouts and requirements for different 
types of defects, each of which might 
kick off a different workflow. “Depend¬ 
ing upon your product, you might have 
different selections; you can select dif¬ 
ferent people” to be in the workflow, he 
said, while targeting different types of 
bugs for different builds. 

Meanwhile, Axosoft QA engineer 
Michael Robinson said terminology can 
be an issue if the items in the software 
are not called the same as terms used 
among the working groups, which can 
lead to a lack of understanding. If 
you’re using Scrum as a development 
methodology, he said, you want the tool 
to call work items “user stories” and 
“product backlogs,” for example, 
instead of “defects” and “features.” 
Customizable fields are an important 
feature in defect-tracking tools, and 
most allow those changes to be made, 
according to experts. 

Seapine’s Rome cautioned, though, 
that it’s important “for everyone on the 
team to know that the custom field 
means. You need to take the time to 
agree on step 1: what a ‘bug’ means.” 
She went on to say that when organiza¬ 
tions have been doing something for a 
while, it’s easy to add custom fields to 
the tools. But after a process is a couple 
of years old, it’s important to make sure 
you’re adjusting it. “It’s not that the 
(defect-tracking) system isn’t working 
anymore, but the process has changed. 
Organizations think at the time the cus¬ 
tom changes are great, but they never 
revisit it, and you end up with a bloated 
system.” 

Atlassian’s Nolen discussed the 

continued on page 41 ► 


Come together, over bugs 

Some people use a spreadsheet or create a Word list. Some use a standalone defect¬ 
tracking tool. Others have a more holistic system for managing the process. But one 
thing all the experts agree with is that the defect-tracking solution must be integrated 
with code repositories and project-management software as part of the development 
life cycle to be most effective. 

Paula Rome, senior product manager for the TestTrack family of products at Seap¬ 
ine, said, “We buy into the notion that you'll get more out of your tools if they're tight¬ 
ly integrated. If you're using a spreadsheet just to list everything, why track them if 
you're not going to do anything with them? What is the workflow and process?" The 
tools, she said, have to take organizations from finding the bug to assigning it, fixing 
it, testing it and redeploying the software—seamlessly. —David Rubinstein 
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Change-based development 

A development process that facilitates the tracking of defects is 
what AccuRev's Brad Hart calls "change-based development." The 
idea is that you manage projects and code at the defect or change 
level. Hart explained: 

"People don't work on files for the sake of working on files. 

They're fixing defects or implementing stories, so we believe 
strongly in the benefits of change-based development. Developers Brad Hart 
are assigned a defect to fix, or a story to implement or an enhance¬ 
ment to add, they'll go onto the system, they'll get the source code, make their 
changes, and you want to be able to associate those changes with the particular defect 
that you're fixing. And from then on, you want to manage based on a defect level. 

"I want to test this defect fix in QA. I want to run a regression test against that 
defect fix. I want to run user-acceptance testing. I want to show it to a product owner, 
make sure it's working the way it's expected to, at the change level, right through the 
development process. So while the defect tracking, story planning and all that is sort 
of the front end of the dispatching of the work, it really has to be a closed-loop process, 
so developers are assigned a task, or a defect, story, whatever, they write the code, 
they fix the code, they submit it against that story/defect/task, and then that code is 
tested throughout the system at the issue level. You want a report between two releas¬ 
es, you want to know what defects were fixed. Those translate to files and versions of 
files and diffs of files, but at the end of the day, you want to work at that defect level." 

The benefit is that you gain isolation and traceability, Hart pointed out. "If you 
were assigned five defects or stories to fix in a particular iteration, ideally, if you're 
employing best practices and continuous integration, etc., and you're developing at 
the change-based level, I can theoretically take what's known good software, add my 
defect fix to it, have it tied to the issue-tracking system so it can see what versions of 
the files and all that were part of that fix, and then build a test of just that known good 
software plus my defect fix, and have that move through the hierarchy. 

"Everybody's got different levels of testing; from the time code leaves a develop¬ 
er's fingertips, it's at its least mature, and as it goes through the various gates in the 
process—there's builds that happen, there's integration testing, regression testing, 
performance testing, QA verifies the fix—everybody has those steps. Each step 
along the way, if you're working in a change-based development mode, that fix or 
story has to walk through that process, and ideally, if you're doing change-based 
development, it's isolated to just that fix. That way, if something breaks, you know 
exactly what broke. You know that that fix wasn't good. It wasn't, 'OK, we just 
slammed in a bunch of new versions of files and we did a whole bunch of changes,' 
and you're testing sort of a configuration of a new version of the software without 
the individual isolation." 

—David Rubinstein 



◄ continued from page 38 

notion of warranty bug fixes. “If your 
team built a feature, it is responsible for 
bugs, even if you’ve moved onto some¬ 
thing else. This gives other people in 
the company dealing with the feature 
someone to go talk to,” he explained. 
For non-warranty bugs, which usually 
involve projects more than 10 years old 
with large feature surfaces, defects go 
into a backlog, and a person or team 
then cranks through them as quickly as 
they can. “It’s important that somebody 
is trying to make some dent in that 
backlog,” he said. 

BEST PRACTICE tt3 

PRIORITIZE, AND USE YOUR 
TIME WISELY 

If everything is a Priority-1 defect, then 
nothing is a Priority-1 defect, said 
Alexsys’ Bianchi. “You have to effective¬ 
ly prioritize,” he said. 

The definition of an acceptable level 
of defects will vary between organiza¬ 
tions, or even from product to product, 
according to TechExcel senior product 
manager Vineet Agarwal. “Some organ¬ 
ization might decide that there can be 
no user-visible bugs in the product. 
Others might say only a Priority 2 or 3 
is allowed in a release.” 

These kinds of priorities help organ¬ 
izations get the most out of their engi¬ 
neering resources. 

Atlassian’s Nolen said his engineering 
teams only have two versions of the soft¬ 
ware in development at any given time: 
trunk and stable. “The way we do ver¬ 
sioning is, let’s say you’ve got project ver¬ 
sion 3.4.0. From that moment on, your 
3.4 branch, that is your stable branch. 
You’ll do .0, .1, .2, .3... While that’s going 
on, your trunk, which is 3.5, or 4.0, that’s 
in development internally and usually 
customers are not seeing that at this 
point, and so you split you efforts 
between bug fixes and maintenance on 
[the stable branch]. Usually, we spend 
like 15 or 20% of total engineering effort 
in that bucket, sometimes a little less. 
And then trunk—your 3.5 or 4.0—is on a 
three- to four-month delivery cycle, and 
that’s where the majority of your effort is 
going. What that means is, anything prior 
to your 3.4 version is essentially frozen. 


“We don’t do fixes, we don’t do special 
one-offs, we don’t do branches per cus¬ 
tomer, we don’t do custom anything,” he 
added. “The only time we ever go back 
further than that four months is in the 
case of critical security fixes. We will 
issue patches for those. But we don’t do 
any traditional bug fix or any mainte¬ 
nance. That basically means our engi¬ 
neering efforts are not split or divided 
across multiple current versions of the 
software, and we don’t have to do each fix 
six or 12 or 18 times as I’ve seen other 
companies sometimes do. It gets espe¬ 
cially bad when you end up branching 


with custom features per customer or 
per even slice of customer. It’s something 
I would highly encourage for people to 
draw a line in the sand and not do that.” 

Bianchi said he’s a big fan of doing 
high-level build plans to begin with: 
Lay out what you want to produce, and 
let the engineers work from that. 

“Make it as clear as possible,” he 
said. “People want to know one thing: 
What am I supposed to work on 
today?” But, you have to remain flexi¬ 
ble as well. “For every 20 planned 
tasks you have, you’ll have 50-100 oth- 
continued on page 42 ► 




42 , SPECIAL REPORT 


I SD Times | February 2013 | www.sdtimes.com | 


◄ continued from page 41 

er things—mostly bugs—and you have 
to track them,” he said. 

BEST PRACTICE tt4 

EMBRACE SPEED 

The trend to agile is a great thing, said 
AccuRev’s Hart. “I haven’t seen anyone 
doing pure agile by the book, but there 
is one thing in common: quicker feed¬ 
back,” he said. “If a bug is a story, it’s 
found right away, as opposed to finding 
it later during integration, like under 
waterfall. This greatly reduces the cost 
of bugs, and I’ve seen defects being 
treated more as first-class citizens.” 

Hart noted that development’s all 
about going faster and producing more 
code, while the operations side is about 
managing risk. “So,” he said, “you want 
to be able to allow the ops team the abil¬ 
ity to say, ‘You know what, we’re real 
close to pushing a new release to the 
site, but there’s a critical fix that I need, 
I want to be able to take just that fix, not 
last night’s build,’ because last night’s 
build might include 15 different things, 
and maybe three or four of those are 
kind of risky that they want to do some 
more verification on. If you’ve got 
everything properly set up and you’re 
working in a change-based model, and 
your systems support it, then the ops 
guy can say, 1 just want to take that crit¬ 
ical fix, just that fix, even though there 
are 10 other fixes in the queue,’ grab 
that change and push that into produc¬ 
tion, mitigating the risk. This will give 
you a lot more granularity as well as the 
traceability you’re looking for. 

Working quickly and reducing the 
cost of bugs allows engineering teams 
to work on tasks that advance the soft¬ 
ware, according to Atlassian’s Nolen. 
“When I think about our engineering 
teams, I think about the ratio between 
work that pushes us forward and work 


that is about removing risk. It’s really 
easy to let that ratio get skewed to 
where you’re spending 80 or 90% of 
your time removing risk and only 10% 
of your time actually building some¬ 
thing a customer cares about. So, the 
way I view it is, if you can find a way to 
reduce the cost of a bug, you can also 
reduce the amount of time you spend 
trying to avoid that bug, because you 
know that you can fix it very quickly.” 

The shorter cycles called for by agile 


development “address the immediate 
need, instead of compiling a list of high- 
and low-priority bugs,” said TechExcel’s 
Perec. This, he said, frees organizations 
from the multiple calls of “When will it 
be fixed? When will it be fixed?” 

Agile methods also get developers 
and QA working more closely, said 
Perec. “The development team is only 
focusing on one or two features. This 
takes pressure off QA, which only has to 
test the two new features and then run 
regression tests. Defects are discovered 
and fixed in the same cycle,” he said. 

Axosoft’s Robinson cited another 
benefit of agile development on defect 
tracking: “When you’re doing a lot of 
agile, you’ll be able to solve things so 
quickly that when you look at the 
(defect) list, those bugs won’t even 
apply anymore. They’ll be fixed before 
they can even be taken off the list.” 

Another critical element to reducing 
delivery time is feedback, according to 


Nolen. “You want the feedback from the 
customer to get back into your engineer¬ 
ing team as quickly as possible,” he said. 
“And you don’t want completed code sit¬ 
ting on the shelf essentially wasting val¬ 
ue or rotting while you try to organize a 
release. The faster you can get it out to 
the customer, the faster you can learn, 
the faster you can deal with problems or 
learn good things, like if you find a direc¬ 
tion that has been profitable, continue to 
pursue that. That’s why we think that 


reducing the number of hours between 
somebody committing a change and a 
customer actually seeing it and the 
developer getting the feedback from 
that interaction is of primary importance 
in a healthy engineering team.” 

Nolen went on to say that when you 
deploy faster, you reduce the cost of the 
bug that you shipped. “If you ship a bug 
and you know it will be out there in the 
field for a minimum of two years, then 
the cost in terms of customers hit—dol¬ 
lars affected—the pain cost is very 
high,” he pointed out. “If you ship that 
same bug, and you know that as soon as 
the first customer hits it you will notice 
it, fix it and that no one else will touch 
it, your cost is very low. Which is one of 
the reasons that Software-as-a-Service 
is such a great sort of paradigm shift for 
our industry because it means once 
you’ve seen the bug one time or two 
times or five times or however many 
times it takes to notice, you fix it and no 
one will ever see it again.” 

BEST PRACTICE tt5 

COMMUNICATE 

“This might sound trite,” said Alexsys’ 
Bianchi, “but it comes down to commu¬ 
nication. The dev team is not thinking 
about what marketing needs. It’s impor¬ 
tant to make everyone’s voices heard.” I 
□ Find this story at http://sdt.bz/37336 


Best practices for defect tracking 

As gleaned from the experts: 

1) Defect tracking is not a standalone task 

2) Define your process, and make sure it's understood 

3) Prioritize, and use your time wisely 

4) Embrace speed 

5) Communicate 




'It comes down to communication. 

The dev team is not thinking about 
what marketing needs. It's important 
to make everyone's voices heard.' 

—Rich Bianchi , Alexsys 
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IT and developers: 

Hand in hand for MDM 


Mutual understanding enhances 
mutual work for mobile apps 


BY SUZANNE KATTAU 

W hen it comes to mobile 
device management, there’s 
a fundamental mindset shift 
that’s required in today’s organizations 
between IT and application developers. 
Instead of being at odds, IT and devel¬ 
opers should collaborate with one 
another. According to businessmen, 
IT’s traditional goal of controlling the 
developer should be replaced by the 
willingness to do six things to help the 
application developer do his or her job 
better. They also said there are also five 
ways in which developers can help IT to 
more effectively do its job. 

Through this partnership, IT and 
developers can better manage their 
organization’s mobile devices and appli¬ 
cations throughout their life cycle. “If 
IT can create value for the app develop¬ 
er, then they’re going to have a partner¬ 
ship where they can work together,” 
said Ojas Rege, vice president of strate¬ 
gy at Mobilelron, a mobile device-man¬ 
agement solution provider. 

“If IT can’t do that, then the line of 
business is just going to go right around 
IT and you’re going to have hundreds 
of orphaned, unsecured applications 
out there in the next 12 months in every 
major organization.” 

Because of the importance of this 
collaboration, mobile device manage¬ 
ment has grown much beyond the fact 
that IT can, for example, remotely wipe 
a device. “The MDM concept is much 
broader today,” said Yoram Mizrachi, 
CTO at Perfecto Mobile, a provider of 
cloud-based mobile-application testing 
solutions. “Now IT has to support hun¬ 


dreds of developers and testers in the 
organization. And developers need, 
basically, an IT service of managing 
those mobile devices.” 

There are six core things that IT can 
do to provide value to the enterprise 
app developer—and thus the organiza¬ 
tion—and it begins with developer 
sourcing, according to experts. IT has to 


help organizations find capable devel¬ 
opers. “What developers are you going 
to use? It’s not easy to find great mobile 
developers,” Rege said. “Which ven¬ 
dors do you use? Do you contract? This 
is the whole notion of, ‘Let me help you 
to get the resources you need to build a 
great app.’ ” 

continued on page 46 ► 
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The second fundamental thing that 
IT can do, Rege said, is to provide devel¬ 
opers with access and support for enter¬ 
prise APIs because all of these apps are 
going to rely on enterprise data. Having 
really well-designed APIs to build apps 
on top of is going to make app develop¬ 
ment cheaper, faster and more effective. 
“That’s absolutely a role that IT can play, 
which is, 'Hey, I’ll give you the APIs, I’ll 
give you guidance around those and I’ll 
educate you on them. I’ll tell you how to 


use them effectively and I’ll make new 
ones available when you need them,’ and 
so forth,” he said. 

The third key thing involves empha¬ 
sizing the importance of the user expe¬ 
rience and giving developers design 
best practices. This is important 
because, with mobile apps in particular, 
user experience is absolutely the litmus 
test to adoption, according to Rege. He 
said if a company screws up the user 
experience, it has just wasted money. 

“All of us are very used to incredible 


consumer apps,” he said. “Consumer 
apps have set the bar for user experi¬ 
ence. So, if I try to cram a big enter¬ 
prise app onto that little mobile screen 
and give you 15 pull-downs, you’re just 
not going to use it.” 

The fourth thing that IT can do to 
help developers is to help them with 
security. It’s probably the one that’s most 
important for IT to do. Rut in some 
ways, Rege said, helping with security 
has traditionally been the hardest for 
them. “This is what I call 'plug-and-play 


Mobile application management: 
A subset of MDM 


BY SUZANNE KATTAU 

If mobile device management is prima¬ 
rily a function of IT, then developers 
play a critical role in what is coming to 
be known as mobile application man¬ 
agement. 

“We consider mobile application 
management and mobile device man¬ 
agement to be very separate but, obvi¬ 
ously, highly related and intertwined 
disciplines,” said Alan Murray, senior 
vice president of products at Apperian, 
a mobile application-management solu¬ 
tion provider. “It’s really a matter of 
where the focus of innovation happens 
and the focus of granularity and that 
sort of thing.” 

Particularly when you talk to ven¬ 
dors, Murray said, you’ll hear the 
phrase repeated again and again: 
They’re all about getting the right apps 
to the right people and the right device. 
He said you hear this phrase in the 
industry so much that it becomes trite, 
but it’s actually true. 

“That is a little bit of what all of us 
are trying to do,” he said. “What I’d like 
to do is break that phrase down and say, 
when you’re looking at the trinity, if you 
will, of mobility, then it’s the right apps, 
devices and people.” 

And “apps” is really synonymous for 
apps and data, said Murray. “Each one of 
those things is an independent system of 
record,” he said. “Most people wouldn’t 
argue that in any enterprise of scale, you 



have a system of record 
for identity because iden¬ 
tity requires a life cycle: 

How do you onboard 
users, offboard users 
change them into 
roles and move 
them into different 
groups and organi¬ 
zations? As an enterprise 
becomes more sophisticated, you 
really need that for devices and for appli¬ 
cations as well because they have com¬ 
pletely independent life cycles.” 

“MAM is more of a subset of an 
overall MDM solution that an enter¬ 
prise would need,” said Carl Rodrigues, 
president and CEO of SOTI, a mobile 
device-management solution provider. 

“There are some people whose twist 
on it is that they just manage the appli¬ 
cation and they have some tools to man¬ 
age that application—they call it MAM, 
or mobile application management,” 
added Simon Vivera, product market¬ 
ing manager at SOTI. “But there’s a 
blur happening between these two 
terms. We say we’re MDM, but we do a 
lot of stuff to manage the individual 
applications as well.” 

Mobile application management for 
individual applications is important, 
but no company only cares about 
MAM, Vivera said. He says companies 
need to worry about both MDM and 
MAM. “If you just worry about one 


aspect, then you’re protecting 
that one app, maybe, but 
there are many other 
things,” he said. 

“For example, that 
device has to get on the 
corporate network, which 
has nothing to do with the 
app—that’s at the device level. 
So you need to worry about, 
when the corporation deploys a 
device and the device is lost or stolen, 
you want to know where that device is, 
you want to take action to delete all the 
data or to put up a message to recover 
the device.” 

And so MAM is all about managing 
the life cycle of mobile applications, 
while MDM is really about managing 
the life cycle of mobile devices. Murray 
said there are elements where they 
overlap but still remain discrete. If 
you’re a small enough organization and 
don’t have particularly sophisticated 
needs, he said you could probably just 
use the same all-in-one solution for 
apps, devices and people. 

“But as you start to grow in scale, 
you’ll realize that you require a specialist 
capability in each of these disciplines,” 
Murray said. “Some tool may be able to 
provide that for you in the future, but 
today, just looking at maturity, you see 
companies as either specializing in 
device management or specializing in 
application life-cycle management—and 
nobody really doing both equally well 
yet. Undoubtedly, that will change as the 
tools mature.” I 
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' It's important for the developer to 
understand the company's business model., 
Their role is to ensure that the application 
experience meets the 
requirements of the IT admin 
who is new to mobility . 1 

— Troy Fulton, Tangoe 



security/ IT needs to give the developer 
something that’s easy for the developer 
and invisible to the user,” he said. 

The fifth thing that IT can do is pro¬ 
vide developers with a fantastic enter¬ 
prise app store. When developers build 
apps, they need to figure out a way to get 
it to the employees. “The way to get it to 
the employees is through a private app 
store that’s run by that enterprise,” Rege 
said. “If IT can provide a consumer- 
grade discovery experience through an 
enterprise app store, that’s a big value to 
the developer because the developer 
will say, 'Hey IT, I really need to get this 
app distributed to all the people in mar¬ 
keting. Can you help me do that?’ And 
IT will say, 'Absolutely. I’ll pop it on my 
internal app store and will make it avail¬ 
able just to the marketing team.’ ” 

The sixth and final thing is IT provid¬ 
ing the developer with tools to drive and 
measure adoption. “So, IT can actually 
tell a developer, 'Hey, you know those 
three apps you built? This one’s getting 
a lot of use, but this other one’s not real¬ 
ly getting a lot of use,’ ” Rege said. With 
these tools, IT and developers can fig¬ 
ure out why some apps are more popu¬ 
lar than others, then find and fix the 
problems with the unpopular ones. 

And in return... 

So IT can help developers, but for the 
collaboration to work, developers must 
also help IT with their job. In speaking 
with experts, five things for developers 
to do have become clear. “First off, it’s 
important for the developer to under¬ 
stand the company’s business model 
because there are many different 
paths,” said Troy Fulton, director of 
product marketing at Tangoe, a mobile 
device-management solution provider. 
“Their role is to ensure that the applica¬ 
tion experience meets the require¬ 


ments of the IT admin manager who is 
relatively new to mobility and managing 
thousands of devices.” 

Fulton said developers need to make 
sure that what they are doing can scale 
across platforms so that there is consis¬ 
tent policy enforcement as well as flex¬ 
ibility to accommodate business-unit 
operations, different locations, and dif¬ 
ferent roles. Developers need to under¬ 
stand how they all contribute to the 
company’s business model, he said. 

Whether they’re internal or external, 
multi-platform mobile applications are 
starting to be very tightly associated with 
enterprise goals, experts said. So, a sec¬ 
ond way that developers can help IT is 
through learning how to do multi-plat- 
form applications. Not all developers are 
good at this. “It’s going to be very impor¬ 
tant for the app developer to really have 
expertise—either individually or in their 
organization—on multiple platforms 
because they’re so different,” Rege said. 

There are ways that people interact 
with iPhone apps that are different 
from Android apps because the user 
expectations are different, he said. “You 
don’t want an app that feels like 
Android on an iPhone or vice versa 
because the user is not going to like it,” 
he said. “There’s a reason they picked 
that operating system, because they like 
the way you navigate screens or the way 
you flip up and down.” 

Understanding the notion of a true 


native experience—what an iOS app 
feels like or what an Android app feels 
like—is very important. “Then being 
able to integrate that into the apps they 
build for the enterprise, that’s a big deal. 
That has real value to it,” Rege said. “I 
think that that’s an absolute responsibili¬ 
ty of an app developer, to understand the 
differences between systems, not just 
from a technical perspective but also 
from a usability perspective.” 

Recause it’s important for a company 
to have cross-platform apps, experts said 
the third thing that developers can do to 
strengthen their partnership with IT is 
make sure they get a clear definition of 
what’s expected for their mobile appli¬ 
cations on each platform. “IT needs to 
decide if it’s okay that a mobile applica¬ 
tion will run only on the iPhone 5 or on 
previous iOS versions or on Android as 
well,” Mizrachi said. “If a certain plat¬ 
form is not supported, IT has to know 
the impact of that on the organization.” 

The fourth thing that developers 
should do, Rege said, is to absolutely go 
to the table and say, “Yes, I am going to 
spend time and think about data securi¬ 
ty.” If IT is going to give developers a 
big novel to read about security prac¬ 
tices, chances are they won’t follow it. 
However, that doesn’t mean app devel¬ 
opers shouldn’t put security high on 
their priority list. 

The fifth thing that app developers 
can do is take a consumer-like view to 
the user experience, because user expe¬ 
rience is fundamental. “That means 
that if you’re a traditional enterprise 
app developer within an organization, 
you may have to relearn your tools of 
the trade,” Rege said. “Some enterprise 
app developers may have a hard time 
moving to that world, and there may be 
some retraining needed.” I 

□ Find this story at http://sdt.bz/37313 



‘IT needs to give the developer 
something that's easy for the 
developer and invisible to the user . 1 
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The following is a chapter from “Managing the Unmanageable,” 
a guidebook for getting the most out of your teammates. 


Motivating 

Programmers 

BY MICKEY W. MANTLE AND RON LICHTY 



B eing able to actually motivate programmers to accomplish 
great feats and deliver difficult projects is so important to 
managing effectively that this chapter is devoted to this sin¬ 
gle topic. You can be good at this and passable at most of the rest of 
the topics in this book and still be a great software manager. If, how¬ 
ever, you are not good at motivating your staff, you will not likely 
succeed as a software manager. 


Motivational Theories 

Several motivational theories emerged 
during the twentieth century that have 
helped shape the thinking of businesses 
regarding motivation. Though a bit the¬ 
oretical for this book, it is important to 
be aware of these theories since they 
periodically but inevitably come up 
during management discussions. Hav¬ 
ing a basic knowledge of these theories 
will help you speak more authoritatively 
regarding your motivational efforts and 
gain management support for them. 

Though the literature is now rife 
with motivational theories, there are 
three primary ones we suggest you 
become familiar with: 

• Maslows Hierarchy of Needs 

• McGregors X-Y Theory 

• Herzbergs Motivation and 
Hygiene Factors 

Of these three, we adapted Herz¬ 
bergs Motivation and Hygiene Factors 
to be more directly applicable to pro¬ 



grammers. Here is a brief review of 
Herzbergs work before discussing 
our adaptation of it. 

Herzberg’s Motivation 
and Hygiene Factors 

Frederick Herzberg, clini¬ 
cal psychologist and pio¬ 
neer of “job enrichment,” is 
regarded as one of the great 
original thinkers in manage¬ 
ment and motivational theory. 

His book “The Motivation to 
Work,” written in 1959, first 
established his theories about 
motivation in the workplace. 
Herzberg expanded his motiva¬ 
tion-hygiene theory in his subse¬ 
quent books, and the absence of any 
serious challenge to Herzbergs theory 
continues to effectively validate it. 

Herzberg was the first to show that 
satisfaction and dissatisfaction at work 


continued on page 50 ► 
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Figure 7.3: Herzberg's Motivators and Hygiene Factors. 


◄ continued from page 49 

nearly always arise from different factors 
and are not simply opposing reactions to 
the same factors, as had always previous¬ 
ly been believed (and still is by the unen¬ 
lightened). Figure 7.3 illustrates how 
Herzbergs research showed that one set 
of factors truly motivate (“Motivators”), 


Motivator? 



Figure 7.4: Herzberg's Motivators. 


whereas its a different set (“Hygiene 
Factors”) that lead to dissatisfaction. 

Herzberg’s research showed that peo¬ 
ple will strive to achieve “hygiene” needs 
because they are unhappy without them, 
but once these are satisfied, the effect 
soon wears off—satisfaction is tempo¬ 
rary. Poorly managed organizations fail to 
understand that people are not motivat¬ 
ed by addressing “hygiene” needs. Peo¬ 
ple are only truly motivated when they 
are enabled to reach for and satisfy the 
factors that Herzberg identified as real 
motivators—achievement, recognition, 
the work itself, responsibility, and 
advancement—which deliver a far deep¬ 
er level of meaning and fulfillment. 

It is important to note that in 
Herzberg’s study, salary ranks, as a 
motivator, a distant sixth and, as a cause 
for dissatisfaction, a distant fourth—it 
barely outranks “interpersonal relation¬ 
ship with supervisor” as a cause for dis¬ 
satisfaction. 

Another way of looking at this is 
shown in Figure 7.4. This representa¬ 
tion shows that you must address the 
basics (“those factors that when lacking 
cause dissatisfaction and impair your 


ability to motivate”) to create a strong 
motivational foundation, but it’s the 
motivators that provide the elevators to 
lift your team to the top. 

The foundational factors are the pil¬ 
ings on which the motivators rest; when 
they are damaged or undermined, 
there is no stable foundation, but in 
themselves they do not motivate. 

Herzberg’s pioneering work paved 
the way to better understanding worker 
motivation. The theory remains rele¬ 
vant and provides a solid foundation for 
motivating your staff. 

Motivational Factors as Applied to 
Programmers 

To try to bring a more software-centric 
view to Herzberg’s factors, we have 
modified his categories and rated them 
as our experience dictates. These 
adapted Herzberg factors are shown in 
Figure 7.5 (see page 53). 

We believe these motivational fac¬ 
tors, while not determined by means of 
formalized research and polling, are 
more representative of the motivational 
factors that actually matter based upon 
our many years of managing and work- 
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ing with programmers. 

We were also intrigued when we 
noticed, after debating the rankings and 
arriving at the final set shown in Figure 
7.5, that these top motivational factors 
largely correspond to the factors that 
make up most successful startup envi¬ 
ronments. 

The stark contrast between factors 
that motivate and those that demotivate 
was what struck Ron with his own “aha” 
moment when he first read Herzbergs 
article in the Harvard Business Review. 
“Whenever I had sat in management 
meetings up until that time to talk 
about motivating our teams, the conver¬ 
sation had always been one-dimension¬ 
al. The realization that we needed to 
focus first on ensuring that our people 
were satisfied with a basic set of factors 
in order to be able to motivate them 
with a different set was a true epiphany. 
It changed the conversation.” 

Common management practice is to 
throw money at the motivational prob¬ 
lem by posting performance bonuses 
tied to project completion. But our 
experience matches Herzbergs evi¬ 
dence that moneys not a motivator and 
risks being a demotivator. That conclu¬ 
sion was substantiated by Daniel Pink in 
his popular book “Drive: The Surprising 
Truth about What Motivates Us.” 

Pink cites repeated studies that show 
that, in fact, contingent rewards—for 
example, the promise of bonuses for 
finishing faster—for the most part 
cause projects to take longer than they 
would have without the promise of a 
reward. The studies show that rewards 
narrow focus, which is just fine for 
motivating manual piece work, but are 
counterproductive for creative work— 
the essence of our programming craft. 
Furthermore, rewards can become 
addictive (“I won’t give extra effort 



unless you offer me a bonus”) and can 
foster counterproductive short-term 
thinking (“Who cares how many bugs— 
I’m going for the bonus!”). 

“When people use rewards to moti¬ 
vate,” Pink says, “that’s when they’re 
most demotivating.” 

He does not propose dropping 
salaries. On the contrary, he says, “One 
reason fair and adequate pay is so 
essential is that it takes the issue of 
money off the table so that [people] can 
focus on the work itself.” 

Pink concludes what we have seen 
many times over the years: “Intrinsically 
motivated people usually achieve more 
than their reward-seeking counterparts.” 

Putting Theory into Practice 

Understanding that job satisfaction and 
dissatisfaction come from different fac¬ 
tors are important messages that set up 
the rest of this chapter. Putting theory 
into practice is always where the rubber 
hits the road. 

In theory and practice , 
practice wins out. 

—COMMON WISDOM 

The rest of this chapter discusses the 
tactics we employ to address those caus¬ 
es of satisfaction and dissatisfaction that 
we, as programming managers, can 
impact. We start with the basic founda¬ 
tional factors that you need to get right 
before you can even begin to motivate 
your staff. These match closely to the top 
seven causes of dissatisfaction (when 
lacking) that we have adapted to pro¬ 
grammers from Hertzberg’s work: 

• Respect for supervisor 

• Having fun 

• Learning and growing 

• Good working conditions 

• Sane company policies and admin¬ 
istration 

• Ethical management 

• Fair compensation 

We then continue with the most 
important motivating factors, which 
also match closely to the top six motiva¬ 
tional factors that we adapted to pro¬ 
grammers from Hertzberg’s work: 

• Making a difference in the world 

• Learning and growing 

• Toys and technology 


• Recognition and praise 

• Having fun with your staff 

• Upside 

Use all of these to help get you and 
your team working to peak productivity 
and motivated to make outstanding 
contributions. 

Foundational Factors-Causes of 
Dissatisfaction (When Lacking) 

Work hard to make sure you provide all 
of the foundational factors, or you will 
find your staff dissatisfied. While simi¬ 
lar to motivational factors (in fact, some 
are the same, with different emphasis), 
foundational factors are essential to 
provide before you can begin to moti¬ 
vate your staff. As shown in Figure 7.4, 
the foundational factors are just that— 
they’re the foundation you need to 
make sure is in place. 

Respected as Supervisor 

You earn the respect of your staff as 
their supervisor in many ways. The 
techniques that follow are ones that we 
have seen work for ourselves and others. 

• Gain technical respect 

• Respect others 

• Establish your culture 

• Lead by example 

• Help solve technical problems 

• Manage and coach 

Manager: a person who controls 
and manipulates resources and expen¬ 
ditures 

Coach: a person who gives instruc¬ 
tion or advice to others 

There s a trite , hut true , saying in 
coaching: A friend loves you just the 
way you are. A coach loves you too 
much to let you stay that way. 

-LYSSA ADKINS 


• Focus on your people 


If you’re a people manager .; your peo¬ 
ple are far more important than any¬ 
thing else you’re working on. 

-TIM SWIHART, Engineering Director, 
Apple Computer 


Your job as a manager is to never 
overlook your people and always make 
them the first priority. 

continued on page 52 ► 
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Figure 7.5: Herzberg's factors as we believe they apply to programmers. 


◄ continued from page 51 

Having Fun 

It is important to allow play into your 
workplace if you expect your staff to 
work hard. If you think of play as 
“unprofessional,” rethink. 

Play helps make the hard work worth¬ 
while and possible. You cannot expect 
people to work hard all the time. Provid¬ 
ing escape valves will pay big dividends. 
The key is to find ways to encourage your 
staff to work hard and play hard. 

Learning and Growing 

Because programmers are not primarily 
motivated by compensation (or “not 
coin-operated,” as we have heard it 
said), providing an atmosphere of learn¬ 
ing and opportunities to grow is exceed¬ 
ingly important. Strive to provide 
access to technical training (on- or off¬ 
site), presentations about new technical 
developments, new development tech¬ 
nologies, and other learning opportuni¬ 
ties. If you are budget challenged and 
cannot provide formal training to all of 
your staff, you should still bring in ven¬ 
dors and peers to present new products 


or industry developments. 

An oft-repeated refrain is that 
“nobody ever jumped ship who wasn’t 
already standing by the rail.” Helping 
your staff to keep learning and growing 
will move them farther away from the 
rail. 

Good Working Conditions 

If your staff doesn’t like being at work, 
they will likely find reasons and excuses 
for not being there! 

• Make the workplace a good place 
to work 

• Be mindful of the “no jerks” rule 

• Be flexible 

• Feed your team 

The secret of software development 
has always been good people 
and lots of Chinese food. 

—JEFF KENTON, Consulting Developer and 
Development Manager, Boston Area12 


Sane Company Policies and 
Administration 

Unless you start a company and have 
full control over its policies and com¬ 
munications, you will never have an 


opportunity to ensure that all the poli¬ 
cies and members of the administration 
are sane. However, here are things you 
can do to make the most of the policies 
and administration you do have. 

• Communicate 

Communicate repeatedly—the biggest 
culprit across organizations is 
lack of communication. 

-DAVID DIBBLE, EVP, Schwab Technology 

• Protect your staff from organiza¬ 
tional distraction 

Be a damper to the noise. 

—JOE KLEINSCHMIDT, CTO and Cofounder, 
Leverage Software 

• Protect your staff from bad organ¬ 
ization communication and poli¬ 
cies 

Ethical Management 

If you expect great things from your 
staff, you must set an example. One of 
the most important examples to set is 
that of a professional and ethical per¬ 
son. A manager who is ethical is forth¬ 
right and honest in dealing with his staff 
continued on page 55 ► 
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and others. He can be trusted to keep 
confidences and stand up for his staff 
against anyone in the company A pro¬ 
fessional shows up on time to work and 
to meetings without being reminded. A 
professional does not use abusive or 
profane language and projects an aura 
of being in control of a situation with¬ 
out undue attention. A professional has 
high expectations of himself and others 
and demonstrates an aura of cool and 
calm under fire. A professional tells the 
truth and does not try to manipulate 
facts or people. 

If you are a manager who is ethical, 
fair, and not self-promoting, you will be 
a long way down the road to having your 
staff think you are a great manager. 

• Be ethical and professional at all times 

• Be fair 

• Compensate fairly 

• Promote appropriately 

Key Motivating Factors 

If you provide the foundational factors, 
you will have provided the foundation. 
Now you can more readily motivate 
your staff by applying the motivating 
factors. This does not mean that your 
staff cannot be motivated without first 
shoring up the foundational factors, but 
your job will be easier if there are no 
causes of dissatisfaction to dilute their 
motivation. 

There are no “silver bullets” for cre¬ 
ating a motivated staff, but those moti¬ 
vating factors listed below are a good 
place to begin. As shown in Figure 7.4, 
once you’ve got the foundational factors 
in place, the motivational factors lift the 
team to top performance. 

Making a Difference in the World 

It is the rare individual who does not 
want to make a difference in the world. 
Many if not most of us chose to pro¬ 
gram at least in part for the opportunity 
to positively impact the world we live 
in. If your organization is building or 
providing something that can be 
pitched as making a contribution— 
improving the world in some way—you 
will have an easier time recruiting and 
motivating your staff. People are always 
willing to work harder and longer when 
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they feel their efforts matter. 

Never doubt that a small group of 
thoughtful, committed citizens can 
change the world. Indeed, it is the only 
thing that ever has. 

-MARGARET MEAD, U.S. Anthropologist 
and Popularizer of Anthropology 


Steve Jobs was the master of high- 
tech evangelism. His famous challenge 
that recruited John Scully out of Pepsi 
to run Apple in its early years was “Do 
you want to sell sugar water or change 
the world?” Steves Apple employees 
have been fanatical about being part of 
Apple’s mission to change the world. 

But it doesn’t take a Steve Jobs to 
pitch this kind of vision (though being 
an “insanely great” communicator with 
a “reality distortion field” certainly 
doesn’t hurt). Look for and find how 
your company is helping to change the 
world and point it out in your recruiting 
pitches, in your regular communica¬ 
tions to your staff, and especially when 
you’re calling upon people to dig deep¬ 
er and work harder for some milestone. 
It will make a difference! 

Learning and Growing 

Few can be successful in the software 
industry without continually learning and 
growing professionally. Technology 
changes so rapidly that new college grads 
would likely be obsolete within only a 
few years if they didn’t continue to learn. 

Toys and Technology 

One of the simplest ways to reward and 
motivate programmers is to equip them 
with a top-of-the-line development 
machine and environ¬ 
ment. Though this was 
even more true in the 
past, it remains true today 
primarily because of this 
rule of thumb: 

Software gets slower faster 
than hardware gets faster. 

-NIKLAUS WIRTH, 

Inventor of Pascal 


Finding ways to use 
toys to reward individuals 
or teams will win you 


points with your staff and help motivate 
them. This is one of the simplest “win- 
win” things you can do! 

Recognition and Praise 

An important part of your role as a 
manager is to make sure that the appro¬ 
priate people receive recognition and 
praise for exemplary work. 

If anything goes bad, I did it. 

If anything goes semi-good, 
we did it. If anything goes real good, 
you did it. That’s all it takes to 
get people to win football games. 
-PAUL "BEAR" BRYANT, Longtime 
University of Alabama Head Football Coach 


Everyone wants to be praised. From 
the time we are born, we learn that 
smiles and praise go hand in hand and 
make us smile and feel good in turn. 
Praise from someone whom you respect 
and work for is tremendously rewarding 
and makes you want to exceed their 
expectations and gain additional praise. 

Likewise with recognition directed 
to individuals, groups, and teams. 
Recognition can be for a task complet¬ 
ed, for a project completed, for a job 
well done, for a milestone achieved, for 
anything that is worthy of recognition. 
It can be delivered verbally, in e-mail, 
in video, in a meeting, in the hallway, 
over lunch, on the way out to the car on 
the way home, and in fact anywhere. 
Never pass up an opportunity to say, 
“Thank you and a job well done!” 

Having Fun with Your Staff 

Previously we listed “having fun” as one 
of the foundational factors for a success¬ 
ful programming man¬ 
ager. It’s also a motivat¬ 
ing factor because 
there is another dimen¬ 
sion to having fun that, if 
you can achieve it, will 
make your job as a man¬ 
ager much easier. That is 
the ability to loosen up 
and enjoy working with 
your staff. 

You don’t have to 
have fun every day, but 
if you can’t sit down 

continued on page 57 ► 
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and laugh with individuals or members 
of your staff about events of the day, or 
the random things that happen, or just a 
chance joke, you and they are likely not 
having fun! 

Celebrate your success and find humor 
in your failures. Don’t take yourself so 
seriously. Loosen up and everyone 
around you will loosen up. Have fun 
and always show enthusiasm. 
—SAM WALTON, Founder of Walmart 


Upside 

Upside has always been an important 
ingredient in personal performance. 
From ancient times, individuals have 
been driven to be successful by the 
promise of multifold return on their 
investment of hard work and results. In 
the high-tech software world this has 
traditionally been in the form of 

• Higher salary 

• Significant bonus 

• Stock options (or equivalent) 

• Strong company growth 

• Increased perks 

Use any form of upside as a tool; rel¬ 
ish it and use it creatively as best you can! 

Personal Commitment 

Though this is not one of the foundation¬ 
al factors or key motivators, we have 
found that for motivating staff, the ability 
to seek out and obtain personal commit¬ 
ments for key deliverables is an impor¬ 
tant ingredient that defines an outstand¬ 
ing software manager. A key to obtaining 
personal commitments at critical times is 
casual engagement all the time. 

When important projects or mile¬ 
stones are looming, the best way to 
ensure peak performance by your staff 
is to seek a personal commitment from 
those key contributors who are most 
capable of producing success. In a pri¬ 
vate setting (or as private as possible, 
without dragging them off to your 
office or a conference room), approach 
them and discuss the looming project 
or milestone. Discuss with them their 
part of the project. After it is clear to 
both of you what their contribution will 
be, reiterate why the project or mile¬ 
stone is important and ask them if they 
will “move heaven and earth” to make 
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sure the milestone or project is com¬ 
pleted on time or successfully. 

Most technical people are not moti¬ 
vated by fear or rewards, but rather by 
personal pride and wanting to perform 
well for their peers or manager(s). By 
securing a promise as a personal com¬ 
mitment, you will find that people work 
harder than you might 
expect from almost any 
reward you could offer. 

Make sure you acknowledge 
their efforts and spend at least 
as much time afterward dis¬ 
cussing the projects or milestones 
completion as you did when origi¬ 
nally soliciting the commitment. 

Keep in mind that you cannot 
seek this kind of commitment too 
often or it will lose its motivational 
effect. Save it for truly important proj¬ 
ects or milestones. 

Technology Offense and Defense 

You and your staff are continually inun¬ 
dated with announcements of new 
technologies. Some of them are obvi¬ 
ously important, and you must play 
good offense and investigate, antici¬ 
pate, and plan for their adoption. One 
of the biggest motivators for most pro¬ 
grammers is the chance to investigate 
innovation and learn new things. 
Rewarding staff who are doing excel¬ 
lent work by sending them to a techni¬ 
cal conference to report on a new area 
of technology, or assigning them to 
investigate a new release of software or 
survey a breakthrough that you need to 
consider adopting are great motivators. 
They will use these brief breaks from 
their usual tasks to recharge themselves 
and feel fulfilled. You must ensure that 
your key contributors have time in their 
schedules so that you can give them 
these kinds of rewards. Consider this an 
important “offensive play” in your man¬ 
agement playbook. 

However, this sometimes requires a 
judgment call on your part. All too often 
many of the new technologies inundat¬ 
ing you and your staff are half-baked, 
unimportant, and will never see com¬ 
mercial success. Any time spent investi¬ 
gating these is time wasted and lost! For 
such technologies, play strong defense 




by ensuring that your staff is not seduced 
by the allure of new “shiny things.” 

Understanding Your Programmers’ 
Motivations Begins on Day One 

As Herzbergs research showed, there 
are many factors that contribute to job 
satisfaction and dissatisfaction. But how 
these factors rank and how 
important they are to anyone 
specifically will vary. More 
than most, programmers are 
individualists, so it only stands to 
reason that what motivates each 
programmer will vary from person 
to person. One of the first things you 
should do is try to understand what 
motivates each one of them. 

The mental model we build 
for each person is mostly a prioritiza¬ 
tion of the topics in this chapter. By 
mentally ranking these items for each 
individual, we can begin to make sure 
that those most important to each indi¬ 
vidual are reinforced when they per¬ 
form in an excellent or outstanding 
fashion. We think that all these are 
important, but when you have the 
opportunity to reward outstanding per¬ 
formance, make sure to address those 
that will have the most impact on the 
individual. 

Summary 

There is no magic silver bullet that will 
allow you to motivate your program¬ 
ming staff. Setting realistic milestones 
and making sure everyone is working 
toward these milestones is a good start¬ 
ing place. But it takes continual efforts 
and personalized attention to make 
sure your staff is motivated to work 
hard and meet demanding milestones. 

The information presented in this 
chapter can get you thinking about how 
best to motivate your staff, but in the 
end you must think about this over and 
over and continually work to make sure 
your staff is motivated every day, week, 
month, quarter, and year. The age-old 
adage of getting “all the oars in the 
water at the same time” is a good one to 
help you visualize how your motivated 
technical staff can and should work 
together as one. I 

□ Find this story at http://sdt.bz/37334 
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In-memory data grids 
set for a big 2013 

Companies prepare for extreme growth 
thanks to analytic needs 


R AM is the hip place to be. For 
modern applications built to 
scale out to thousands or even 
millions of users, scaling the data 
behind those applications has long been 
a difficult task. But a host of in-memory 
data grids from companies like McOb- 
ject, Oracle and Terracotta are solving 
the scalable data-store problem, and 
they all expect 2013 to be a banner year 
for such software. 

Massimo Pezzini, vice president and 
fellow at Gartner Research, said he 
expects explosive growth in the in¬ 
memory data store market in 2013. 
While the research he cited is not yet 
published, he did have figures on the 
market and its potential for growth. 

“We think in 2011 the market for in¬ 
memory database was approximately 
US$250 million in terms of license and 
maintenance revenue... We’ve spoken 
with vendors, and some are projecting 
high double-digit, if not triple-digit 
growth in 2013. Terracotta is expecting 


BY ALEX HANDY 

to triple its revenues this year. We think 
this is going to be a $1 billion market by 
2016. In software, $1 billion is a big 
market,” he said. 

There are many reasons for this 
growth, but Pezzini said a few use cases 
are most common. “I would say the most 
obvious use case is really caching: 
caching a database, caching a session in a 
website, etc.,” he said. “Quite a lot of cus¬ 
tomers have started using an in-memory 
data grid in that way: their own layer to 
speed up the performance of Web appli¬ 
cations.” But new use cases are cropping 
up thanks to the proliferation of clouds 
and the need to operate at scale. 

“Lately, we have seen examples of 
customers using an in-memory data 
grid as a data-management platform, as 
a platform to host the database of 
record,” said Pezzini. “In business prac¬ 
tice, that is not relational, because in¬ 
memory data grids are based on an 


object-oriented NoSQL paradigm. This 
is one of the reasons customers are 
looking into in-memory data grids.” 

Mike Allen, vice president of prod¬ 
uct management at Terracotta, thinks 
there are a few reasons behind the 
growth of in-memory data grids, but 
there’s one large factor he cited. “One is 
data volume, and now you can suddenly 
get machines with a lot of memory very 
cheaply,” he said. “You can stack up six 
servers with a half-terabyte of RAM 
each, and then keep all your data in 
memory, which was never really possi¬ 
ble before. We scale that grid pre¬ 
dictably and scale it to that capacity.” 

Another reason for the growth of in¬ 
memory data grids, said Allen, is the 
new focus on analytics in business. With 
Terracotta, or any other in-memory 
data grid, analytics can be run in real 
time because all the information is 
stored in RAM. 

“If I’m doing transactions on an e- 
commerce site, I don’t typically have a 
view into those transactions until after,” 
he said. “But I can now look at that data 
in-flight, and do real-time promotions or 
modify pricing. I can offer people incen¬ 
tives, or correlate current actual behavior 
with profile info about historical access.” 

Uri Cohen, vice president of prod¬ 
uct management at GigaSpaces, said 
there are two major reasons for the 
growth of in-memory data grids. “The 
first is that the market for such solu¬ 
tions has definitely grown, with drivers 
such as the explosion of user-generated 
data and Web-scale deployments,” he 
said. “Whereas in the past most of the 
demand for these technologies came 
from high-end financial services and 
continued on page 62 ► 
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telecom applications, today its preva¬ 
lent in almost every vertical. 

“We’re seeing this demand in e-com¬ 
merce, travel, fraud detection, homeland 
security, and SaaS implementations, to 
name a few. Some apps need to process 
tens or even hundreds of thousands of 
events per second, which is only feasible 
if you’re using a distributed architecture. 
The in-memory aspect of things is what 
allows you to do it at real-time latencies, 
meaning you can do this as the events are 
flowing into your system and not have to 
wait for a batch Map/Reduce job to get 
the processed data and insights. 

“The second trend is that with the 
advent of NoSQL data stores and cloud 
technologies, which drive people toward 
distributed architectures, the market is 
much better educated about such tech¬ 
nologies and understands the trade-offs 
associated with them, with terms like 
CAP and BASE being widely known and 
reasonably well-comprehended. This 
saves us a lot of work in explaining our 
technology and how to implement your 
applications on top of it.” 


Analyze that 

Indeed, analytics are a major new draw 
to in-memory data grids, said Pezzini. 
“In the context of big data applications, 
what happens is the customers store data 
in memory to an in-memory data grid. In 
some cases, they are storing terabytes of 
data, and they want to run analytical 
types of applications on top of that. 
[That means supporting] query lan¬ 
guages and Map/Reduce APIs. I believe 
this will be the next battleground.” 

Craig Blitz, senior group product 
manager of Oracle Coherence, the in¬ 
memory data grid cited as the most pop¬ 
ular by Pezzini, said that Coherence is 
seeing increasing use in analytics, as well. 

“Data grids play a key role in handling 
these new demands,” said Blitz. “Their 
distributed caching capabilities offload 
shared services by reducing the new 
number of repeated reads across all 
application instances and, in the case of 
Oracle Coherence, by batching and coa¬ 
lescing writes. Application objects are 
stored in-memory in the application tier 
or a separate data grid tier (or both, using 
Oracle Coherence’s near-caching capa¬ 


bilities). Oracle Coherence provides a 
rich set of query, Map/Reduce aggrega¬ 
tion and eventing capabilities to provide 
for a scalable compute platform as well. 
Companies are using data grids to drive 
real-time event-based calculations as 
application objects are updated, then to 
provide fast access to those calculations.” 

Making money 

Ted Kenney, director of marketing at 
McObject, has taken a more vertical 
approach to in-memory data grids. 
McObject now offers the eXtremeDB 
Financial Edition specifically targeted at 
stock-trading platforms and other high- 
volume, high-speed trading systems. 

“Data in the financial space is fairly 
specialized market data—with things like 
trades and quotes—and a lot of it is time- 
series data,” said Kenney. “It’s the same 
data point with different values that 
change at regular intervals over time. 
That requires a somewhat specialized 
approach to data management. Column- 
oriented databases are very good at that 
for fairly technical reasons. It’s much 
more efficient to manage it in a column 
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Using McObject's eXtremeDB Financial Edition, transactions and trade data can be stored in a 
column-based grid rather than in a row-based data store. 


database than in a row. That is the heart 
of the new features we’ve added in 
eXtremeDB Financial Edition.” 

And that shows one of the key differ¬ 
entiators between in-memory data grids 
and other Big Data solutions, said Ken¬ 
ney: In-memory data grids don’t have to 
be huge. An Apache Hadoop cluster is 
almost worthless unless it’s hosting at 
least a dozen terabytes of data; analyzing 
such data would be faster on a local sys¬ 
tem than a cluster designed for 
petabytes of data. Thus, in-memory data 
grids, like eXtremeDB Financial Edi¬ 
tion, can still yield performance benefits 
when hosting smaller sets of data (in the 
terabyte range and below). 

“In the financial space, it doesn’t feel 
particularly crowded because of the 
unique needs of that space,” said Ken¬ 
ney. “The new entrants are large-scale 
big software applications. We find one 
of the most appealing things to devel¬ 
opers in the financial space is some¬ 
thing that comes from our embedded 
background: It has a short execution 
path, and doesn’t consume as many 
CPU cycles as these NoSQLs will.” 

But analytics remains a major draw 
for new users of in-memory data grids, 
said Jon Webster, vice president of busi¬ 
ness development at GridGain. “We 
take an in-memory data grid and we also 
have a processing data grid. We give you 
a cohesive API. That’s where you get the 
processing plus storage. If you look at 
some of the older caching technologies, 
you’re still dealing with the pattern of 
The data is stored somewhere; even if 
it’s in Coherence, take it out of there and 
put it back into an application server and 
process it somewhere else.’ The real 
problem is that moving that data is 
exceptionally expensive,” he said. 

To that end, GridGain uses that 
cohesive API to allow developers to ana¬ 
lyze data stored in their in-memory data 
grid without having to move it to anoth¬ 
er platform. When dealing with fresh 
terabytes of data every day, that can save 
developers a great deal of time. 

You host it for me 

The popularity of in-memory data grids 
has even spawned a new cloud-based 
service. Ofer Bengal, cofounder and 


CEO of Garantia Data, said that his 
company has taken the popular NoSQL 
data store Redis and turned it into a 
scalable data grid platform hosted with¬ 
in the company’s own cloud service. 

While Redis is an open-source key- 
value store, Garantia Data turns it into 
an automatically scalable in-memory 
data grid for applications that need ter¬ 
abytes of quickly accessible information. 

“When it comes to replicating Redis 
onto disk, or into persistent storage, 
you’ll normally see performance degra¬ 
dation,” said Bengal. “We have over¬ 
come this problem, and we offer real¬ 
time applications to persistence storage 
without any degradation of perform¬ 
ance. We also offer instant fail-over.” 

And, at the end of the in-memory 
day, the usefulness always comes back 
to the application being tied into the 


grid. “More and more, we see in-mem- 
ory data grid technology being embed¬ 
ded into other software products,” said 
Gartner’s Pezzini. “Enterprise servers, 
VPN tools, packaged applications. Of 
course this is going to help a lot with 
the establishment of this market.” 

And while the market is growing 
quickly, Gartner does not yet have a mag¬ 
ic quadrant for in-memory data grids. “I 
think TIBCO could be the rising star 
here, and could become an important 
player in this market,” said Pezzini. “Soft¬ 
ware AG with Terracotta will be growing 
primarily because of their software mar¬ 
keting organization. [Software AG] is 
going to leverage Terracotta inside many 
web Methods products, and a bunch of 
new offerings they have there in the 
making around Big Data.” I 

□ Find this story at http://sdt.bz/37316 
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Code Watch 

BY LARRY O'BRIEN 


What would Map do? 


I n my previous column, I discussed the power of 
Map/Reduce, both in terms of its common use 
as an engine for distributed analysis of Big Data 
and also as a fundamental concept, an insight into 
the power of the functional programming concepts 
of Map (which applies a function across a collec¬ 
tion of elements) and Reduce (which summarizes 
or otherwise condenses a collection of elements). 

As IVe also discussed in my column, for two years 
Scala was my day-to-day programming language, and 
for much of that time I waited for the functional pro¬ 
gramming “light bulb” that would help me under¬ 
stand the zeal of functional programmers. 

I’m a stupid person. One of the ways my stupid¬ 
ity manifests is that it’s hard for me to recognize— 
until they’re no longer easily available—the best 
elements of a tool. It was only when I started pro¬ 
gramming in C++ that Smalltalk made sense to 
me. And it’s only been recently, when I returned to 
the land of more mainstream languages, that I can 
sense a functional light bulb flickering on. 

Many discussions of functional programming 
argue that the key is the lack of reassignment to vari¬ 
ables after they’ve been initialized (also known as 
immutability, referential transparency, etc.). And 
that’s certainly beneficial, but preferring immutabili¬ 
ty to state is simply good programming and does not 
guide technical design or architecture. Admittedly, 
functional languages’ emphasis on programming 
without reassignment drives the developer toward 
separating control flow from data manipulation (and 
that’s a benefit), but it’s not transformative. 

But there are two aspects of functional program¬ 
ming that I find are dramatically underused in day- 
to-day object-oriented programming. One is lazy 
evaluation. This is the ability of a functional language 
to pause a calculation, return an intermediate value, 
and then continue the calculation at some time in the 
future. Most modern languages have a “yield” key¬ 
word, and if you combine a yield with a recursive 
function, you get lazy evaluation-like behavior, 
including the ability to define very large—even infi¬ 
nite—sequences, and yet work with them efficiently. 

Once I lost laziness as a day-to-day facility of my 
programming language, I found myself asking 
“Where can I yield?” in my designs. This has 
improved my designs because I can confidently 
delay the bookkeeping aspects of a search or tra¬ 


versal. These aspects can clutter code significantly 
since they often demand that your functions, in 
addition to doing their domain work, maintain a 
knowledge of “where in the calculation” they are. 

The other question that I’ve begun to ask myself 
is iC What would Map do?” This is my shorthand 
reminder to always think about how higher-order 
functions would apply to my classes. “Map” and 
“Reduce” are just two of several commonly written 
higher-order functions that are most familiar from 
collection classes. My jaw would drop if any pro¬ 
grammer told me they didn’t prefer working with 
these types of functions to writing imperative loops. 
But the flickering light bulb in my head is not “Those 
are nice in collection classes,” but “A Map function 
would make me structure this class differently.” 

Specifically, I think that always considering 
higher-order functions naturally guides you toward 
class structures that have two 
benefits: classes naturally prefer 
aggregation to inheritance, and 
individual classes seem to have 
the same level of abstraction in 
their contents. 

For instance, in classic object 
orientation, the sentence “A Car is 
an object that has an Engine and four Wheels and 
some number of Doors” might guide you to a Car 
that has one Engine (a relatively abstract concept), a 
hard-coded number of Wheels (quite concrete), and 
an indeterminately-sized collection of Doors (a con¬ 
cept that may or may not warrant a class to model). 
“What would Map do?” guides the design toward a 
Car object that aggregates traveling statistics, con¬ 
cepts relating to power, and physical components. 
That might not be a good design, but the important 
thing is that it’s a perspective that wouldn’t naturally 
arise from standard OOP processes. 

I don’t always actually write higher-level func¬ 
tions for my classes. Although the question guided 
me in valuable ways, I’m actually not sure about 
using “What would Map do?” for a Car class. I sus¬ 
pect that instead of a single Map function, the 
answer might be several functions with names 
meaningful to the automotive domain (like Map- 
TravelStatistics, MapPower and MapComponents). 
And then one higher-order function to rule them 
all? I don’t know; the light bulb is still flickering. I 



Larry O’Brien is a 
developer evangelist/ 
advocate for Xamarin. 
Read his blog at 
www.knowing.net. 


Wat would Map do?' is my 
reminder to always think 
about how higher-order 
functions apply to classes. 


□ Find this story at 
http://sdt.bz/37309 
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Guest View 

BY ANGUS TELFER 

AMQP takes over middleware messaging 


Angus Telfer is a founder 
and CTO of INETCO 
Systems, and was 
co-chair of the OASIS 
AMQP 1.0 Technical 
Committee at the time 
of its standardization. 


The word is out that 
the way to connect services 
is by open standards. 


□ Find this story at 
http://sdt.bz/37189 


t the end of October, members of the OASIS 
international open standards consortium 
approved version 1.0 of the Advanced Message 
Queuing Protocol (AMQP), an enterprise-level 
messaging protocol, as an OASIS Standard. 

Active support by Red Hat, Microsoft, VMware, 
INETCO and a plethora of other vendors makes 
this a very credible standard. Active support by 
JPMorgan, Bank of America, Deutsche Borse, 
Goldman Sachs, Credit Suisse, the US Depart¬ 
ment of Homeland Security, and other large end 
users guarantees its a standard that will be seen in 
the enterprise at the very least. 

The standardization of AMQP marks a turning 
point in the industry, a turning point from the past, 
consisting of a small number of enterprise-capable 
proprietary systems plus numerous, specific-pur¬ 
pose “roll your own” protocols, to the future, con¬ 
sisting of a standard messaging 
protocol used to interconnect 
products and services from a 
large number of vendors. In fact, 
with a modest degree of luck, 
AMQP will do to middleware 
messaging what TCP did for 
computer-to-computer communications, IEEE 
802 did for local area networking, and HTTP did 
for the Internet. 

The standardization of AMQP signals the death 
knell for proprietary messaging systems. Large end 
users have signaled by their participation in the 
AMQP standards group—and their use of products 
developed with the protocol—that they are tired of 
being held hostage to proprietary messaging sys¬ 
tems. Even if AMQP does not itself end up being 
the ultimate messaging standard, the word is out 
that the way to connect services is by open stan¬ 
dards. Translation: If vendors do not want to use 
AMQP, they still need to be ready to propose an 
alternate open standard. 

What justification is there for this confidence? 
Well, the big push over the past few years has been 
to connect products and services from multiple 
vendors into ever-larger total solutions. In the days 
when distributed systems existed within the enter¬ 
prise itself, the different components could be pro¬ 
vided by a single vendor using a proprietary mes¬ 
saging technology. In the current environment of 


cloud and service-based computing, this is no 
longer the case. Different components come from 
different vendors: some big, some small, some Lin¬ 
ux, some Microsoft, some in North America, some 
in Europe, some written in Java, some in .NET, 
and so on. Connecting all these via proprietary 
technologies is just not an option. 

An excellent way to ascertain when an area is ripe 
for standardization is to look at the external environ¬ 
ment. In the case of messaging protocols, the 
increasing activity in the area of open-source proto¬ 
cols of various types, along with the increased inter¬ 
est and adoption rates by large organizations, has 
made it clear that the time has come for a standard. 
Likewise, when vendors and organizations start 
working together, it is inevitable that the end result 
will be a standard that will revolutionize the space. 

So why should AMQP be that standard? It is 
attractive as a general-purpose messaging protocol 
for a number of reasons: 

• It supports a comprehensive set of interaction 
patterns. 

• It is independent of any broker technology, 
allowing it to be used by a diverse set of brokers. 

• It stands on its own, allowing AMQP to be used in 
real-time, point-to-point messaging environments. 

• There are interoperable products that use it. 

• Non-key capabilities are optional, allowing it to 
be very wire-efficient for real-time applications 
requiring minimal messaging capabilities, while 
still supporting more intensive broker-based 
transactional applications. 

• The timing of the AMQP standardization effort is 
dead on to when the market needs it. 

• It enjoys support from both large and small ven¬ 
dors as well as large, game-changing organizations. 

• There are currently no competing standards that 
offer similar capabilities, enjoy similar support, 
and are application/broker independent. 

Full disclosure: I wouldn’t be using AMQP or par¬ 
ticipating in its standardization if I did not believe in 
it. However, I am a co-chair of the OASIS AMQP 1.0 
Technical Committee, and the CTO of a company 
that uses AMQP in its main product. 

One word of caution: AMQP 1.0 has evolved 
from a number of earlier attempts at messaging. 
Anything that is not specifically AMQP 1.0 should 
be ignored. They’re different protocols. I 
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Analyst View 


BY ROB ENDERLE 


The Fall of Apple in Three Acts 




A pple is clearly trending downward, cutting 
back on supplies as I write this, and its valu¬ 
ation is dropping sharply week after week. At CES 
this year, the typical shadow Apple casts over the 
show wasn’t in evidence, and Steve Wozniak, one 
of the living Apple founders, has been lamenting 
that even Microsoft is out-innovating the company 
While no doubt there will be hills and valleys to 
this decline, it has and will occur in three discern- 
able steps if it repeats its prior fall. 

Act I: The Departure/Death of the King 

This event has clearly passed but, as with the last 
time Jobs left the company, Apple appears increas¬ 
ingly rudderless and unable to make critical deci¬ 
sions or take important risks. Jobs, during his last 
term in the company, integrated himself even 
more deeply. In the 1980s, he drove marketing and 
the Mac; over the last decade, he drove everything 
and seemed to be a critical part of every major 
decision, from how a product was conceived, to the 
final design, to how it was marketed and sold. 

He surrounded himself with experts in areas he 
didn’t know (like retail) and in skills he didn’t have 
or want to develop (like logistics). The result was a 
cohesive team with Jobs as the central hub. When 
Jobs left, Apple lost this hub, and over the follow¬ 
ing months, person by person, the key parts of the 
engine he was central to departed, were fired, or 
were left in their jobs without the guidance from 
Jobs they depended on. Yes, many of the early 
decisions had been made by Jobs before his depar¬ 
ture, but most are done now, and some, like the 
iPad Mini, were overturned. 


providing premium products, which will eventually 
force it to become ever more aggressive in pricing 
the products it does have, thereby collapsing its mar¬ 
gins. Instead of standing alone, Apple will become 
more equivalent to Dell, Lenovo, Acer, Asus and HP. 

Act III: The Desperation Death Spiral 

As the company drops in value, the stock and stock 
options that form the basis for much of the com¬ 
pensation for board members and top executives 
fall, eliminating much of the personal income and 
wealth of these powerful people. This tends to 
make them increasingly desperate, with the result 
being from-the-hip decisions regarding new prod¬ 
ucts or services that will increasingly find a hostile 
market because they weren’t well thought-out. 

Desperation leads to mistakes, one of them 
being the rapid cycling of top executives, either 
because they leave for firms with 
better prospects, or because they 
become the scapegoat for the 
current month’s problems. This 
typically includes the CEO, who 
may find his or her time in the 
firm cut short. Thus begins a 
parade of CEOs, each a little 
worse than the previous, as the pool of smart, tal¬ 
ented, experienced CEOs looks at Apple less as an 
opportunity for fame and fortune and more as a 
place that ends CEO careers. 

The last time Apple got into this phase, Steve 
Jobs came back personally and saved it. There is no 
Steve Jobs to come back this time. 


Rob Enderle is a 
principal analyst at the 
Enderle Group. 


The last time Apple got into 
this phase, Steve Jobs came 
back and saved it. There is 
no Steve Jobs this time. 


Act II: The Death of a Brand 

Today, Apple releases new products to Walmart to be 
sold at sharp discount, or to be given away—with a 
phone contract—for free. Jobs designed Apple to be 
a premium brand like Porsche; the automaker was 
one of the design examples he used. As with all pre¬ 
mium brands, the perception of the brand is para¬ 
mount and never compromised with a value product. 
If the firm wants to do a value product, it does so 
under a brand not connected to the premium parent. 

Apple instead is increasingly bringing out value 
products under the Apple brand. The result is that 
people are slowly shifting away from seeing Apple as 


Epilogue: Saving Apple 

This trend can be stopped at any time, not by replac¬ 
ing individual members, but by redesigning the 
executive team so that it can function without Jobs. 

However, while I’ve seen a lot of companies— 
including the old Apple—drop into this pattern, I’ve 
never seen any pull out of it. That doesn’t mean it 
can’t be done, only that it is very unlikely. 

I think we are beginning to see the end of Apple, 
a catastrophe that won’t be avoided unless it’s board 
addresses it by retooling the executive team and the 
company around it. While this could be done in 
time, history suggests it won’t, and that will mean a □ Find this story at 
sad ending for this once-great company. I http://sdt.bz/37311 
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Industry Watch 


BY DAVID RUBINSTEIN 


Time to rethink the back end for mobile 


David Rubinstein is 
editor-in-chief of SD Times. 


The 'mobile multiplier'is 
changing the traditional 
three-tier infrastructure that 
has defined applications. 


□ Find this story at 
http://sdt.bz/37324 


S itting at a desktop, the average bank customer 
checks his or her accounts three or four times a 
month. A mobile customer, while waiting for a train, 
riding the train, sitting at a cafe or even waiting to 
pick up a child from soccer practice, will check his or 
her bank account 15 to 20 times per month. 

This, according to Archie Roboostoff, results in 
more than a quadrupling of traffic to the site, stress¬ 
ing the infrastructure. Roboostoff, the Rorland solu¬ 
tions portfolio director at Micro Focus, made the 
point to illustrate that mobile devices—and mobile 
access to websites and applications—are changing 
the expectations of back-end software. 

Its this “mobile multiplier” 
that Roboostoff believes is driving 
a change in the traditional three- 
tier infrastructure that defined 
applications almost since the 
inception of the Web. “The three- 
tier infrastructure—Web hed, app 
server, database—was great. You’d 
initiate a request and get a response... We saw a 
number of customers try to leverage that same infra¬ 
structure for when they were putting out their 
mobile app. They tried to make it work with Web 
services that were wrapped, but one of the dynamics 
changing the need to do that is the notion of what we 
call the mobile multiplier. 

“Let’s say, for simplicity, you’ve got 1,000 end 
users hitting your website on a daily basis,” he con¬ 
tinued. “Now those end users typically have more 
than one device, and they’ll start interacting with 
your back end a little bit more often, especially if 
you’re getting into a customer intimacy application, 
where you’re wanting to push data more often.” 

Roboostoff pointed to such things as node.js, 
which creates smaller, localized servers, and Kaaz- 


ing’s WebSocket Gateway, which is designed to fire 
off rapid requests over and over again—“almost 
like mini HTTP servers”—as ways to address these 
issues. 

“We’re seeing customers setting up dedicated 
infrastructure for mobile traffic,” said Roboostoff. 
“Microsoft and IRM will make the necessary adjust¬ 
ments in their technologies eventually” to accom¬ 
modate mobile applications, he said, and he’s seeing 
companies like AppFog, which offers a Platform-as- 
a-Service, making a number of acquisitions for high¬ 
er-speed, higher-scale node.js deployments. 

For enterprise organizations, this means mak¬ 
ing sure their transaction gateways are strong 
enough to satisfy such things as in-app purchases; 
when players buy 100 coins in the middle of a 
game, you need to verify the coins are delivered 
without hanging up the game, he said. 

Further, these organizations must be able to test 
their applications across every platform, not only 
functional and load testing, but also doing network 
simulation to ensure the application will perform 
regardless of where the request is coming from. 

“We’re seeing a lot of customers not only rethink 
how they’re doing their infrastructure planning, but 
also their pre-production load testing,” he said. “Tra¬ 
ditionally, you put out a Web server, send a sustained 
user load, see what happens, and fine. But customers 
are asking us to...simulate network-loading patterns. 
They want 100 users to come in at 9 a.m. on a 4G 
network, followed by everyone on a Tl, then after 
lunch have users coming in on a BlackBerry, to see 
what’s happening and why. 

“Don’t just think you can throw a mobile app 
out there, hope for the best, and hope that the 
existing infrastructure works,” he said. “You’re 
going to need to think it through.” I 


Events Calendar 


DATE 

SHOW 

CITY 

SPONSOR 

LINK 

Feb. 15-16 

Snow*Mobile 2013 

Madison, Wise. 

Sapling Events 

snow-mobile.org 

Feb. 25-March 1 

RSA Conference 

San Francisco 

RSA 

www.rsaconference.com 

Feb. 26-28 

ApacheCon 

Portland, Ore. 

Apache Software Foundation 

na.apachecon.com 

Feb. 26-28 

Strata Conference 

Santa Clara 

O'Reilly Media 

strataconf.com 

March 3-6 

SPTechCon San Francisco 

San Francisco 

BZ Media 

www.sptechcon.com 

March 3-6 

Pulse2013 

Las Vegas 

IBM 

www.ibm.com/pulse 

March 13-21 

PyCon Python Conference 

Santa Clara 

Python Software Foundation 

us.pycon.org/2013 


For a more complete calendar of U.S. software development events, see www.sdtimes.com/calendar. Information is subject to change. Send news about upcoming events to events@bzmedia.com. 
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Big Data gets real 
at Big Data TechCon! 


Discover how to master Big Data from real-world practitioners - instructors who 
work in the trenches and can teach you from real-world experience! 



Class Catalog 

Come to Big Data TechCon to learn 
the best ways to: 

• Collect, sort and store massive quantities of 
structured and unstructured data 



• Process real-time data pouring into your organization 

• Master Big Data tools and technologies like Hadoop, 
Map/Reduce, NoSQL databases, and more 

• Learn HOW TO integrate data-collection 
technologies with analysis and business-analysis 
tools to produce the kind of workable information 
and reports your organization needs! 

• Understand HOW TO leverage Big Data to help 
your organization today 


BigData 

— TECHCON 


Register Early and SAVE! see last page for details! 


April 8-10,2013 
Boston, MA 

www.BigDataTechCon.com 
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Chairman’s Letter 


BigData 

— TECHCON 



Dear colleague, 

The amount of data our organizations collect is growing expo¬ 
nentially, and the more data we have, the greater the opportuni¬ 
ties we have to use that data information. Big Data can improve 
efficiency, reduce waste, empower employees and customers, in¬ 
crease competitiveness, and benefit the bottom line. 

The tangible benefits of Big Data analytics are 
well known. You can read about them in the IT 
press, and also in business journals and the daily 
newspaper. Many books have been published 
about the “why” of Big Data. Conferences de¬ 
voted to exploring the trends are happening 
everywhere. 

We understand the “why” of Big Data, the 
practical reasons for storing, searching, sharing, 
analyzing and reporting through these gigantic data sets. 

Big Data TechCon isn’t a “why” conference. It’s the HOW-TO 
conference for Big Data. Practical workshops. Technical classes. 
Thorough examinations of the real-world choices in storage, pro¬ 
cessing, analysis and reporting of Big Data information. Strategies 
for rolling out Big Data projects in your organization. 

Come to Big Data TechCon to learn HOWTO accommodate 
the terabytes and petabytes of data from your Web logs, social 
media interactions, scientific research, transactions, sensors and 
financial records. Learn how to index, search and summarize the 
Big Data. Learn how to empower employees, inform managers, 
reach out to customers. 


Big Data TechCon is technology-agnostic. The workshops and 
classes apply to Big Data in your data center or in the cloud, from 
hosted environments to your own servers. The sessions apply to 
relational databases, NoSQL databases, unstructured data, flat 
files and data feeds. 

The faculty have real-world experience that you can tap into, 
whether you use Java, C++, .NET or JavaScript; whether you like 
MySQL, SQL Server, DB2 or Oracle; whether you love or hate 
Hadoop, HBase, Cassandra or Pig; and whether you are looking at 
dozens of terabytes or hundreds of petabytes. 

Produced by BZ Media—publisher of SD Times, the leading 
magazine for software development managers—Big Data Tech¬ 
Con is the biggest, most info-packed, most practical HOW-TO Big 
Data conference in the world. No hype. All tech, all the time. 

During the three-day conference, choose from 40+ workshops 
and technical classes at all levels, from overview to intermediate 
to advanced. Of course, the technical classes and workshops are 
only part of the benefit of attending Big Data TechCon. 

Learn from the smartest, hardest-working faculty in the Big 
Data universe in a way you never could by reading a book or 
watching a webinar. Mingle with fellow attendees. Talk shop dur¬ 
ing meals and receptions. Be inspired by keynotes, be informed 
by general sessions, be impressed by the hottest Big Data tools in 
the Expo Hall. It’s all waiting for you. 

See you in Boston! 

Alan Zeichick 

Conference Chairman 



The H0W-T0 conference for Big Data 
and IT Professionals! 

• Learn tips, tricks and techniques that will make you 
your company’s Big Data Expert! 

• Discover how to master Big Data from real-world 
practitioners—instructors who work in the trenches 
and can teach you from real-world experience 
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Event 

Schedule 

Sunday, April 7 

4:00 pm-7:00 pm 

Registration Open 

Monday, April 8 

7:30 am-7:00 pm 

Registration Open 

7:30 am-8:30 am 

Morning Coffee 

8:30 am - 10:00 am 

Workshops 

10:00 am - 10:15 am 

Coffee Break 

10:15 am- 12:15 pm 

Workshops 

12:15 pm- 1:15 pm 

Lunch Break 

1:15 pm-3:00 pm 

Workshops 

3:00 pm-3:15 pm 

Coffee Break 

3:15 pm-5:00 pm 

Workshops 

5:15 pm-6:30 pm 

Lightning Talks 

Tuesday, April 9 

7:30 am-7:00 pm 

Registration Open 

7:30 am-8:30 am 

Morning Coffee 

8:30 am-9:30 am 

Technical Classes 

9:30 am-9:45 am 

Coffee Break 

9:45 am - 10:45 am 

Keynote - Dr. Michael Stonebraker 

11:00 am - 12:00 pm 

Technical Classes 

12:00 pm-7:00 pm 

Exhibit Hall Open 

12:15 pm - 12:45 pm 

Sponsored Classes 

12:45 pm - 1:45 pm 

Lunch Break 

12:45 pm - 1:45 pm 

Women in Big Data Luncheon 

1:45 pm-2:45 pm 

Technical Classes 

2:45 pm-3:15 pm 

Coffee, Ice Cream in Exhibit Hall 

3:15 pm-4:15 pm 

Technical Classes 

4:30 pm-5:30 pm 

Keynote - Oscar Boykin, Ph.D. 

5:30 pm-7:00 pm 

Networking Reception in Exhibit Hall 

7:15 pm-8:45 pm 

Fireside Chats 


Wednesday, April 10 


7:30 am-4:00 pm 

Registration Open 

7:30 am-8:45 am 

Morning Coffee 

8:45 am-9:45 am 

Technical Classes 

10:00 am - 11:00 am 

Keynote - Adam Laiacano 

11:00 am-3:30 pm 

Exhibit Hall Open 

11:00 am - 11:30 am 

Coffee Break in Exhibit Hall 

11:30 am - 12:30 pm 

Technical Classes 

12:45 pm - 1:45 pm 

Lunch Break 

1:45 pm-2:45 pm 

Technical Classes 

2:45 pm-3:15 pm 

Coffee Break & Prizes in Exhibit Hall 

3:30 pm-4:30 pm 

Technical Classes 

4:30 pm 

Conference Closes 


BigData 
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Keynotes 


Tuesday, April 9 


9:45 am -10:45 am 


Dr. Michael Stonebraker 

Database Pioneer and Adjunct Professor at MIT 


Big Data Is (at Least) Four Different Problems 

Our ability to mine and analyze “Big Data” has 
I yet to catch up to our ability to generate and 
collect it. Moreover “Big Data” and “Big Analytics” 
mean different things to different people. This 
keynote from one of Big Data's rock stars unpacks the Big Data 
problem into four distinct domains: Big volumes of data with 
SQL analytics, Big volumes of data with complex analytics, 

Big velocity, Big diversity. 


4:30 pm-5:30 pm 



Oscar Boykin, Ph.D. 

Staff Software Engineer at Twitter 
and co-author of Scalding 

Addition in the Large: Simple Counts 
and Not-So-Simple Counts 

Details coming soon! 



Wednesday, April 10 

10:00 am -11:00 am 



Adam Laiacano 

Data Scientist and Engineer 
at Tumblr 


ftTTMflfa 


m 


Improve, Don’t Abuse: Use Big Data for Your 
Customers, not Against Them 

Social networks are able to collect large amounts 
of activity data from their user and customer 
base. As Big Data professionals, we conduct experiments on 
custom data sets to measure the effectiveness of our products or 
advertising methodologies. Since a social network is effectively 
useless without an active community, our companies owe it to 
their users to create new and better products based on this 
information. Learn how our data analysis and predictive analytics 
must take a different approach than Big Data in fields like 
finance, medicine, and defense. 
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Special Events 


Monday, April 8 

5:15 pm -6:30 pm 

Lightning Talks 

Learn something new in a handful of short, targeted talks, 
PLUS names will be drawn for free giveaways. 


Tuesday, April 9 



9:45 am -10:45 am 

Keynote 

Dr. Michael Stonebraker 

Database Pioneer and Adjunct Professor at MIT 

Big Data Is (at Least) Four Different Problems 



12:00 pm-7:00 pm 

Exhibit Hall Open 

See how the Big Data ecosystem is 
growing and evolving by speaking with 
technical experts in our Exhibit Hall. 


12:45 pm-1:45 pm 

Women in Big Data Luncheon 

Hosted by Lisa Vaas, freelancer for technology publications, 
including CIO Magazine, ComputerWorld and PC Magazine 

Lisa Vaas is pleased to host a 
luncheon for all women 
attending Big Data TechCon. 

What are your challenges in the 
field of Big Data? What are your 
proudest triumphs? Please, 
bring stories to share as we 

explore all aspects of being a woman in this exciting field. 
Come prepared to connect, network, socialize, tell tales, 
and listen to your peers from across the Big Data world. 





2:45 pm -3:15 pm 

Coffee, Ice Cream 
in the Exhibit Hall 
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Tuesday, April 9 (continued) 

4:30 pm -5:30 pm 

Keynote 

Oscar Boykin, Ph.D. 

Staff Software Engineer at Twitter and co-author of Scalding 

Addition in the Large: Simple Counts and 
Not-So-Simple Counts 

5:30 pm -7:00 pm 

Networking Reception 
in the Exhibit Hall 

7:15 pm-8:45 pm 

Fireside Chats 




Wednesday, April 10 



10:00 am-11:00 am 

Keynote 
Adam Laiacano 

Data Scientist and Engineer at Tumblr 

Improve, Don’t Abuse: Use Big Data for 
Your Customers, not Against Them 



11:00am-3:30 pm 

Exhibit Hall Open 

Come explore the latest in 
Big Data developer resources 
in our Exhibit Hall. 


11:00 am -11:30 am 

Coffee Break in Exhibit Hall 

2:45 pm -3:15 pm 

Winner’s Circle prizes 
announced in 
the Exhibit Hall 
























5 * April 8-10,2013 * Boston, MA * www.BigDataTechCon.com 


Conference Planner : 


TECHNICAL LEVELS 

Overview 
Intermediate 
Advanced 
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Monday, April 8 Click or touch class title to read description. 

TIME 

TITLE 

INSTRUCTOR 

TECHNICAL 

LEVEL 

7:30 am -7:00 pm 

Registration Open 



8:30 am-12:15 pm 

MORNING WORKSHOPS (10:00 am -10:15 am Coffee Break) 




Crash Course in Machine Learning 

Dean Wampler 

■ 


Getting Started with Cassandra 

Ben Coverston 

■ 


Hands-on NoSQL for the DBA 

Lynn Langit 

■ 


Introduction and Best Practices for Storing and Analyzing Your Data 
with Adobe Hive 

Mark Grover 

■ 


Introduction to Hadoop, Map/Reduce and HDFS for Big Data Applications 

Serge Blazhievsky 

■ 

12:15 pm-1:15 pm 

Lunch Break 



1:15 pm -5:00 pm 

AFTERNOON WORKSHOPS (3:00 pm - 3:15 pm Coffee Break) 




Hadoop Data Warehousing with Hive 

Dean Wampler 

■ 


A Case Study: Data Visualizations and Insight into 

How America Eats at Restaurants 

David Hwang 

■ 


Hands-on Google BigQuery and Google Predictive API 

Lynn Langit 

■ 


Hive, Pig, Cascading and Codd: A Crash Course in Map/Reduce 

Relational Languages via an Appeal to History 

Daniel Eklund 

■ 


Programming with Scalding and Algebird 

Krishnan Raman 

■ 

5:15 pm -6:30 pm 

Lightning Talks 
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■ Overview 

■ Intermediate 
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Tuesday, April 9 Click or touch class title to read description. 


TIME 

TITLE 

INSTRUCTOR TECHNICAL 

LEVEL 

7:30 am -7:00 pm 

Registration Open 



8:30 am-9:30 am 

Visualizing Your Graph 

Max De Marzi 

■ 


Building an Impenetrable ZooKeeper 

Kathleen Ting 

■ 


Apache Cassandra—A Deep Dive 

Ben Coverston 

■ 


Taming Elephants, Bees and Pigs - The Big Data Circus 

Asish Thusoo 

■ 


Distributed Search and Real-Time Analytics, Part 1 

Jason Rutherglen and Ryan Tabora 

■ 


How to Integrate Structured and Unstructured Data with Avro 

Serge Blazhievsky 

■ 


Data Modeling and Relational Analysis in a NoSQL World 

Michael Miller 

■ 

9:45 am -10:45 am 

Keynote by Dr. Michael Stonebraker 



11:00 am-12:00 pm 

Beyond Map/Reduce 

Dean Wampler 

■ 


Agile Analytics Applications with Hadoop 

Russell Jurney 

■ 


Hands-on MongoDB 

Lynn Langit 

■ 


Analytics Maturity Model 

John A. De Goes 

■ 


Distributed Search and Real-Time Analytics, Part II 

Jason Rutherglen and Ryan Tabora 

■ 


Big Data Science: Extracting Truth from Large, 

Multi-Structured Data, Part 1 

Dan Mallinger 

■ 


Getting Started with R and Hadoop, Part 1 

Jeffrey Breen 

■ 

12:00 pm -7:00 pm 

Exhibit Hall Open 



12:15 pm-12:45 pm 

Sponsored Classes 



12:45 pm-1:45 pm 

Lunch Break, Women in Big Data Luncheon 



1:45 pm -2:45 pm 

Untangling the Relationship Hairball with a Graph Database 

Max De Marzi 

■ 


Mastering Sqoop for Data Transfer for Big Data 

Jaroslav Cecho and Kathleen Ting 

■ 


Hadoop Programming with Scalding 

Dean Wampler 

■ 


Real-Time Hadoop 

Boris Lublinsky and Michael Segel 

■ 


Getting Started with Predictive Modeling: Simple Models 
and Basic Evaluation 

Claudia Perlich 

■ 


Big Data Science: Extracting Truth from Large, 

Multi-Structured Data, Part II 

Dan Mallinger 

■ 


Getting Started with R and Hadoop, Part II 

Jeffrey Breen 

■ 

2:45 pm -3:15 pm 

Coffee, Ice Cream in Exhibit Hall 



3:15 pm -4:15 pm 

How to See and Understand Big Data 

Jock Mackinlay 

■ 


Seven Deadly Hadoop Misconfigurations 

Kathleen Ting 

■ 


Hands-on Hadoop for Developers—Understanding Map/Reduce 

Lynn Langit 

■ 


The Dark Sides of Predictive Modeling: Tricks and Pitfalls 

Claudia Perlich 

■ 


NoSQL for SQL Professionals 

Dipti Borkar 

■ 


Hadoop by Example 

Serge Blazhievsky 

■ 


In-Database Predictive Analytics 

John A. De Goes 

■ 

4:30 pm -5:30 pm 

Keynote by Oscar Boykin, Ph.D. 



5:30 pm -7:00 pm 

Networking Reception in Exhibit Hall 



7:15 pm -8:45 pm 

Fireside Chats 
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Wednesday, April 10 Click or touch class title to read description. 

TIME 

TITLE 

INSTRUCTOR 

TECHNICAL 

LEVEL 

7:30 am -4:00 pm 

Registration Open 



8:45 am - 9:45 am 

Building Successful Data Science Teams 

Dan Mallinger 

■ 


Building Applications Using HBase 

Amandeep Khurana 

■ 


Oozie: A Workflow Scheduler for Hadoop 

Boris Lublinsky 

■ 


Extending Your Data Infrastructure with Hadoop 

Jonathan Seidman 

■ 


Matrix Methods with Hadoop 

David Gleich 

■ 

10:00 am -11:00 am 

Keynote by Adam Laiacano 



11:00 am -3:30 pm 

Exhibit Hall Open 



11:00 am -11:30 am 

Coffee Break in Exhibit Hall 



11:30 am-12:30 pm 

Microsoft’s Big Data Story 

Lynn Langit 

■ 


Setting up a Neo4j Graph Database Cluster on Amazon EC2 

Max De Marzi 

■ 


HBase Schema and Table Design Principles 

Amandeep Khurana 

■ 


How to Fit a Petabyte in Apache HBase 

Jean-Daniel Cryans 

■ 


Hadoop Backup and Disaster Recovery 

Jairam Ranganathan 

■ 


What You Can Do with a Tall-and-Skinny QR Factorization on Hadoop 

David Gleich 

■ 


High-Speed Data Ingestion with Sharded NewSQL Databases 

Ryan Betts 

■ 

12:45 pm-1:45 pm 

Lunch Break 



1:45 pm -2:45 pm 

Implementing a Simple Mongo Application 

Andrew C. Oliver 

■ 


Implementing a Real-Time Data Platform Using HBase 

Ravi Veeramachaneni 

■ 


Map/Reduce Tips and Tricks 

Boris Lublinsky 

■ 


Using Apache HBase’s API for Complex Real-Time Queries 

Jean-Daniel Cryans 

■ 


Selecting the Right Big Data Tool for the Right Job, and Making It Work 

Eddie Satterly 

■ 

2:45 pm -3:15 pm 

Coffee Break & Prizes in Exhibit Hall 



3:30 pm -4:30 pm 

A Survey of Probabilistic Data Structures 

Jim Duey 

■ 


Running Mission-Critical Applications on Hadoop 

Dave Jespersen 

■ 


Visualizing Big Data in the Browser 

Simon Metson 

■ 


Building High-Volume Web Applications 

Stefano Santoro 

■ 


4:30 pm Conference Closes 
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Morning Workshops 

8:30 am -12:15 pm 

Crash Course in Machine Learning EZS3 

Dean Wampler 

Machine learning is a broad topic with many categories of 
problems and approaches, as well as lots of special-purpose tricks 
of the trade. We can’t cover all that, of course, but we can give you 
a taste of the most common problems addressed by machine 
learning and some of the techniques used to address them. 

We’ll discuss supervised learning, where a system is trained 
with data that has already been classified, then use that system 
to classify new data. We’ll use the particular example of classifying 
spam vs. non-spam e-mails. Classification applies a set 
of labels to data. We’ll experiment with a SPAM classifier. Finally, 
we’ll finish this section with a brief discussion of regression, 
where a value for a continuous number is assigned to data, such 
as predicting housing prices from historical data. 

How does Netflix determine what movies to recommend to 
you? How does Amazon know what products you might want? 
We’ll examine recommendation engines that compare either user 
preferences or item features to make recommendations for you. 
We’ll experiment with a movie-recommendation engine. 

Clustering is an example of unsupervised learning, where we 
don’t train the system in advance. Instead, the system finds 
structure in the data “on its own.” We’ll look at two examples: 
k-means clustering, where you find k clusters and their centers in 
a dataset; and nearest-neighbors, where you find the nearest 
neighbors to a given data point in an efficient way. We’ll experi¬ 
ment with k-means. 

We’ll finish with a brief discussion of topics you might pursue 
on your own; the importance of data preparation, probabilities 
and statistics; and more advanced machine-learning concepts, 
such as probabilistic graphical models and neural networks. 

Side notes: This workshop is good for intermediate to ad¬ 
vanced developers and data analysts. Some programming ability 
will be assumed, such as using a text editor, writing simple scripts 
in some language, and using simple Linux “shell” commands. 
Level: Advanced 

Getting Started with Cassandra MS 

Ben Coverston 

Unless you have experience with Google BigTable, HBase or 
Cassandra, column-oriented databases are probably an enigma. 
Cassandra’s data model is both simple and powerful. 

It takes some time to get used to the differences between the 
relational model and Cassandra’s column-based model. 

Cassandra is not schema-less, but we do not model relation¬ 
ships in Cassandra either. Data Modeling in Cassandra usually 
consists of finding the best way to denormalize the data when 
you put the data in the database so that you can retrieve it quickly 


LMJ This icon indicates code will be shown in the session. 

and efficiently. This workshop will prepare you for success when 
modeling your data. This tutorial will dive into Cassandra from a 
developer perspective and give you the tools you need to get 
started with Cassandra today. 

This workshop will cover: 

• An introduction to Cassandra in the context of relational 
databases and non-relational alternatives 

• Best practices for modeling your data in Cassandra 

• Cassandra Query Language (CQL version 3) 

• Wide and composite columns 

• Practical examples 

• Anti-patterns (things to avoid) 

For a more advanced look at Cassandra, attend the “Apache 
Cassandra — A Deep Dive” class. 

Level: Overview 

Hands-on NoSQL for the DBA MS 

Lynn Langit 

In this workshop, you will learn the what, why, how, when and 
where around non-relational (NoSQL) database technologies. 
Here you will understand the different types of NoSQL datastor¬ 
age options that are in popular use now. These include graph 
databases such as Neo4j and Freebase, key-value stores like Dy- 
namoDB and Cassandra, document data stores such as Mon- 
goDB, and column data stores like Hadoop. You will learn about 
and work with both locally hosted versions of these data stores 
and cloud-based versions. Importantly, you’ll understand which 
types of data stores will best suit your business and technical re¬ 
quirements. 

The workshop’s objectives include: 

• Understanding the capabilities (and limitations) of NoSQL 
databases 

• Seeing an example of each type of NoSQL database 

• Understanding basic query technologies for NoSQL data 
stores by working with Hadoop (working with Map/Reduce, 
Pig and Hive) 

• Getting hands-on experience installing, querying and per¬ 
forming basic administration with MongoDB 

Prerequisite knowledge: RDBMS development or administration 
(SQL Server references will be used in this talk for comparison). 

Prerequisite software: The class will be taught in Windows; stu¬ 
dents can use a Mac, but step-by-step instructions will differ 
slightly. 

Level: Intermediate 
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Introduction and Best Practices for Storing and Analyzing 
Your Data with Adobe Hive EES 

Mark Grover 

This workshop on Apache Hive will introduce Hive and the 
best practices for storage and data analysis in it. Hive is an open- 
source data-warehousing system based on top of Apache Hadoop 
which lets you query, mine and analyze the data stored in 
Hadoop clusters using familiar SQL-like queries. 

This workshop will go through a hands-on exercise on how 
users can use Hive queries to perform data analysis. Because not 
all analysis can be expressed using SQL-like queries, the work¬ 
shop will cover how to write, test and use User Defined Functions 
and User Defined Aggregate Functions in Hive. 

This workshop will then go through some of the best practices 
related to partitioning, bucketing and joining various datasets in 
Hive. 

You will also learn how to leverage other technologies in the 
Hadoop ecosystem, such as plugging in Map/Reduce scripts from 
Hadoop directly into their Hive queries, and how to integrate 
HBase with Hive to share data across the two systems. The 
workshop will wrap up with a question-and-answer session. 

Note: This hands-on workshop requires you to bring a laptop. 

It is also recommended to install an SDK beforehand (a link will 
be provided). 

Level: Intermediate 

Introduction to Hadoop, Map/Reduce and HDFS for 
Big Data Applications EES 

Serge Blazhievsky 

This workshop will teach you how to solve Big Data problems 
using Hadoop in a fast, scalable and cost-effective way. It is de¬ 
signed for technical personnel and managers who are evaluating 
and considering using Hadoop to solve data-scalability problems. 

We will start with Hadoop basics and discuss best practices for 
using Hadoop in enterprises dealing with large datasets. We will 
look into the current data problems you are dealing with and po¬ 
tential use cases of using Hadoop in your infrastructure. The pres¬ 
entation covers the Hadoop architecture and its main 
components: Hadoop Distributed File System (HDFS) and 
Map/Reduce. We will present case studies on how other enter¬ 
prises are using Hadoop, and look into what it takes to get 
Hadoop up and running in your environment. 

Two case studies will cover near-real-time data-processing sce¬ 
narios and Hadoop cluster implementations for large clusters 
(2,000 to 4,000 nodes). The near-real-time case study can be used 
as guidance for building the infrastructure of near-real-time ar¬ 
chitecture. All components used in architecture are open source 
under the Apache license, and will provide cost-effective solu¬ 
tions for solving Big Data problems. 

By attending this tutorial, you will: 

• Understand Hadoop’s main components and architecture 

• Be comfortable working with Hadoop Distributed File System 


• Understand Map/Reduce abstraction and how it works 

• Understand components of a Map/Reduce job 

• Know best practices for using Hadoop in the enterprise 
We will demonstrate the real-life code for basic Map/Reduce 

jobs and code for working with HDFS during the workshop. 

Level: Intermediate 

Afternoon Workshops 

1:15 pm-5:00 pm 

Hadoop Data Warehousing with Hive EES 

Dean Wampler 

In this hands-on workshop, you’ll learn how to use Hive for 
Hadoop-based data warehousing. You’ll also learn some tricks of 
the trade and how to handle known issues. 

Well spend most of the workshop using a series of hands-on 
exercises with actual Hive queries, so you can learn by doing. 

Well go over all the main features of Hive’s query language, 
HiveQL, and how Hive works with data in Hadoop. Well also con¬ 
trast Hive with relational and non-relational database options. 

Hive is very flexible about table schemas, file formats, and 
where the files are stored. Well discuss real-world scenarios for 
the different options. Well briefly examine how you can write Java 
user defined functions (UDFs) and other plugins that extend Hive 
for data formats that aren’t supported natively. 

You’ll learn Hive’s place in the Hadoop ecosystem, such as how 
it compares to other available tools. Well discuss data organiza¬ 
tion and configuration topics that ensure best performance and 
ease of use in production environments. 

Side notes: This workshop is suitable for beginnerdata analysts 
and software developers. Bring your laptop pre-installed with a 
suitable secure shell (ssh) client, such as Putty for Windows. 

Mac OS and Linux systems come preconfigured with ssh. 

Some prior SQL experience will be assumed. 

Level: Overview 

A Case Study: Data Visualizations and Insight into How 
America Eats at Restaurants 

David Hwang 

This workshop will discuss the pains and lessons learned from 
building out a database that collects data from thousands of 
restaurants across North America on a nightly basis. The chal¬ 
lenges of integrating data from disparate Point of Sales systems, 
data cleaning, data normalization, ETL and generating insight 
will all be discussed. 

Additional topics include: 

• Why choose open source? 

• Why pick Mongo? 

• What’s up with ZFS? 

We’ll wrap up by covering lessons learned and what’s next 

Level: Overview 





10 * April 8-10,2013 * Boston, MA * www.BigDataTechCon.com 

Workshops Monday, April 8 


BigData 

— TECHC0N 



Hands-on Google BigQuery and Google Predictive API MS 

Lynn Langit 

In this workshop you will learn the what, why, how, when and 
where around two of Google’s cloud data offerings. Here you will 
learn about Google BigQuery and the Google Predictive API and 
why you might consider using either of these cloud services. You 
will learn how to write effective, performant and useful queries 
when using either of these services. You’ll also get to wire up the 
services via their exposed APIs. The course will be taught in Java. 
The workshop’s objectives include: 

• Understanding the capabilities (and limitations) of Google 
BigQuery and the Google Predictive API 

• Writing and executing queries against BigQuery and the Pre¬ 
dictive API 

• Understanding how to wire up the services into your appli¬ 
cation 

• Understanding costs involved in using these cloud services 
from Google 

Prerequisite knowledge: RDBMS administration or develop¬ 
ment. Data-mining experience is helpful but not required. You 
should know Java. 

Prerequisite software: You must have a Gmail account to sign 
up for these services from Google. 

Level: Intermediate 


• Why do the syntax of Pig, Hive and Cascading differ, and 
what are each trying to emphasize or de-emphasize? 

• Running Exercises: Practice HiveQL, Pig and 
Cascading/Cascalog/Scalding concepts as they are intro¬ 
duced. 

• When to use Pig, Hive or Cascading over another 

• Code and examples of each language 

The following prerequisites ensure that you will gain the maxi¬ 
mum benefit from the class: 

• Programming experience: This is a developer’s course. We 
will write Hive, Pig, Cascading/Scalding/Cascalog applica¬ 
tions. Prior programming experience is recommended. 

• Linux shell experience: Basic Linux shell (bash) commands 
will be used extensively. Some prior experience is recom¬ 
mended. 

• Experience with SQL databases: SQL experience is helpful 
for learning these languages, but not essential. 

The main format of this workshop will be follow-along, al¬ 
though all code will be provided in case you want to code simul¬ 
taneously. You may log into remote EMR instances to build, test 
and run the applications if you wish, but it will be on your ac¬ 
count, and the instructor may not be able to help you with debug¬ 
ging. You will also be provided with all the exercise software, so 
you can view it on your laptop if desired. 

Level: Advanced 


Hive, Pig, Cascading and Codd: A Crash Course in 
Map/Reduce Relational Languages via an 
Appeal to History EM 

Daniel Eklund 

This workshop will teach you how to simultaneously imple¬ 
ment the relational operators as defined by famed computer sci¬ 
entist E.L Codd in Hive, Pig and Cascading. You, the developer, 
will focus on the abstract concepts of the Relational Algebra in 
order to learn ALL the languages simultaneously. Theory can 
sometimes be dry, but, in this case, revisiting Codd’s original in¬ 
tentions — and his seminal papers —can accelerate our learning 
of these new Big Data Relational Languages: 

• What are the Relational Algebra operators? Select, project, 
join, cross, union, intersect and divide 

• What is the Pig, Hive and Cascading syntax for these operators? 

• How do high-level languages and libraries like Pig and Hive 
compile these operators to Map/Reduce? 


Programming with Scalding & Algebird EM 

Krishnan Raman 

This is a hands-on coding workshop. We will code up a few 
Scalding programs in different domains: portfolio optimization, 
healthcare, cosine similarity and random forests. While Scalding 
looks like a thin Scala API atop Cascading, this appearance is de¬ 
ceptive. The power of Scala combined with the mapping, group¬ 
ing and joining primitives in Scalding, along with the Algebird 
abstract algebra library, allow for a whole new level of flexibility 
with Big Data. Matrix operations in Scalding are powered by Alge¬ 
bird, and using large dimension matrices as a primitive, we can 
tackle problems in diverse domains that employ linear algebra 
over very large datasets in a batch mode. 

Note: You are expected to have installed Scala, Scalding and Al¬ 
gebird on your laptop before the workshop commences. 

Level: Advanced 
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Tuesday, April 9 
8:30 am -9:30 am 

Visualizing Your Graph EM 

Max De Marzi 

Attend this class to learn how to see graphs conceptually and 
visually. Learn to think of your domain as a graph, and visualize it. 
See just one node, a handful of relationships, a sub-graph, or 
everything all at once. You’ll learn how to create a pretty picture of 
your friends’ friendships, see clusters of nodes congregate, and 
get lost in a 3D graph in this class. 

Level: Overview 

Building an Impenetrable ZooKeeper iH 

Kathleen Ting 

Apache ZooKeeper is a project that provides reliable and 
timely coordination of processes. Given the many cluster re¬ 
sources leveraged by distributed ZooKeeper, it’s frequently the 
first to notice issues affecting cluster health, which explains its 
moniker: “The canary in the Hadoop coal mine.” 

Come to this class and you will learn: 

• How to configure ZooKeeper reliably 

• How to monitor ZooKeeper closely 

• How to resolve ZooKeeper errors efficiently 

Culling from the diverse environments we’ve supported, we will 
share what it takes to set up an impenetrable ZooKeeper environ¬ 
ment, what parts of your instrastructure specifically to monitor, 
and which ZooKeeper errors and alerts indicate something seri¬ 
ously amiss with your hardware, network or HBase configuration. 
Level: Intermediate 

Apache Cassandra—A Deep Dive EM 

Ben Coverston 

Recently, there has been some discussion about what Big Data 
is. The definition of Big Data continues to evolve. Along with vari¬ 
ety, volume and velocity (which the usual suspects handle well), 
other facets have been introduced, namely complexity and distri¬ 
bution. Complexity and distribution are facets that require a dif¬ 
ferent type of solution. 

While you can manually shard your data (Oracle, MySQL) or 
extend the master-slave paradigm to handle data distribution, a 
modern big data solution should solve the problem of distribu¬ 
tion in a straightforward and elegant manner without manual in¬ 
tervention or external sharding. Apache Cassandra was designed 
to solve the problem of data distribution. It remains the best data¬ 
base for low latency access to large volumes of data while still al¬ 
lowing for multi-region replication. We will discuss how 
Cassandra solves the problem of data distribution and availability 
at scale. 

This class will cover: 

• Replication • Data Partitioning 

• Local Storage Model • The Write Path 

• The Read Path • Multi-Data-Center Deployments 
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• Upcoming Features (1.2 and beyond) 

For the most benefit from this class, attend the “Getting Started 
with Cassandra” workshop. 

Level: Intermediate 


Taming Elephants, Bees and Pigs - The Big Data Circus 

Asish Thusoo 

This class will discuss the reasons and motivations behind the 
Big Data revolution and how it has evolved from previous data- 
processing technologies. Hadoop’s technical advantages and em¬ 
phasis on scale over raw performance is primarily driven by the 
growth in variety of data sources, for example. 

Based on real-world experience while at Facebook, the instruc¬ 
tor will talk about some of the key challenges of scale and the evo¬ 
lution of these technologies out of necessity, starting with 
Hadoop, expansion with SQL on top, and adding microstrategy 
and business intelligence layers. This class will cover specific is¬ 
sues and solutions at Facebook, such as latency gaps in the infra¬ 
structure solved by caching results in the MySQL tier, and the 
investment made to build low-latency query engines on HDFS. 
This will lead to a discussion of business demands and the tech¬ 
nical responses. 

Finally, we will discuss the future of Big Data and how technol¬ 
ogy is continuing to simplify the process and become accessible 
for all. We are able to address noise in the data and use the cloud 
to simplify what to use and what not to use by hiding these tech¬ 
nologies behind a comprehensive data platform. 

Attend this class to learn about the issues that were encountered 
in the trenches at Facebook. They were addressed by the trailblaz- 
ers and can now be handled even by smaller, leaner companies. 
Level: Intermediate 


Distributed Search and Real-Time Analytics, Part I EH 

Jason Rutherglen and Ryan Tabora 

In this hands-on, two-part class, you will learn the importance 
of distributed search from the instructors’ industry experience 
and knowledge of real-world use cases. We’ll introduce different 
architectures that incorporate distributed search techniques, and 
share pain points experienced and lessons learned. Building on 
that, we’ll depict the landscape of distributed search tools and 
their future directions. 

For the hands-on part, you will learn how to install and use 
Apache Solr for real-time Big Data analytics, search, and reporting 
on popular NoSQL databases. You’ll also learn some tricks of the 
trade and how to handle known issues. 

We will spend around 30 minutes providing some background 
and use-case information on distributed search. We’ll then go 
through finer technical details with exercises every 10 minutes or 
so. We will go through the code on stage, but you can follow along 
with the EC2 instances that will be set up. 

Prerequisites: You should be familiar with Java and Unix shell 
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Tuesday, April 9 

commands. Some prior familiarity with SQL and Big Data solu¬ 
tions like Hadoop/Hbase/Cassandra/Solr would be helpful but 
not required. 

Level: Intermediate 

How to Integrate Structured and Unstructured Data 
withAvro EH 

Serge Blazhievsky 

This class is designed to demonstrate how to use the Apache 
Avro Java serialization libraries with Hadoop frameworks to speed 
up integration of big volumes of unstructured data sets. 

The class will teach you about the Avro framework. You will 
learn about: 

• Avro schema definitions and data types 

• Avro record creation 

• How to create Avro schemas programmatically 

• How to sort records by setting a property in schema 

• How to read records from a file 

• Hadoop Map/Reduce integration with Avro data 

• Advantages of using Avro data over flat files or map files 

• Specifics of the integrations with Mapper and Reducer code 

• Avro format for Map/Reduce result output 

• Cascading Map/Reduce jobs 

• Map/Reduce for converting flat files into Avro data 

We’ll use two real-life examples to demonstrate the advantage 
of using Avro versus regular files. 

Level: Advanced 

Data Modeling and Relational Analysis in a NoSQL World EH 

Michael Miller 

The new wave of NoSQL technology is built to provide the flex¬ 
ibility and scalability required by agile Web, mobile and enter¬ 
prise applications. Interestingly, any system that supports 
chained Map/Reduce processing (specifically Map/Reduce-Map) 
fulfills the basic query requirements of a SQL engine. Therefore, 
we will work to help you bridge the gap between SQL, relational 
(big) data, and the brave new world of NoSQL. 

In this class, you will learn how to model real-world relational 
data in a modern document database. We next go on to compile 
various SQL operations (SELECT, SUM, AVG, JOIN, etc.) into ex¬ 
ceptionally simple Map/Reduce programs. We finish with a study 
demonstrating the performance, scalability and “time-to-value” 
benefits of this approach, specifically the pre-computation of ma¬ 
terialized views. 

The class will be a mix of chalkboard and interactive demon¬ 
strations. 

Prerequisites: Bring a laptop with a modern Web browser 
(Chrome, Safari or Firefox). Previous experience with basic script¬ 
ing languages (e.g. JavaScript) is an advantage but not a require¬ 
ment. All data and code samples will be provided at the beginning 
of the class. 

Level: Advanced 


11:00 am -12:00 pm 

Beyond Map/Reduce 

Dean Wampler 

Apache Hadoop is the current darling of the Big Data world. At 
its core is the Map/Reduce computing model for decomposing 
large data-analysis jobs into smaller tasks, and distributing those 
tasks around a cluster. Map/Reduce itself was pioneered at Google 
for indexing the Web and other computations over massive 
datasets. 

The strengths of Map/Reduce are cost-effective scalability and 
relative maturity. Its weaknesses are its batch orientation, making 
it unsuitable for real-time event processing, and its difficulty of 
implementing data-analysis idioms in the Map/Reduce comput¬ 
ing model. 

We can address the weaknesses in several ways. First, higher- 
level programming languages, which provide common query and 
manipulation abstractions, make it easier to implement Map/Re¬ 
duce programs. However, longer term, we need new distributed 
computing models that are more flexible for different problems 
and which provide better real-time performance. 

Well review these strengths and weaknesses of Map/Reduce 
and the Hadoop implementation, then discuss several emerging 
alternatives, such as Cloudera’s Impala system for analytics, 
Google’s Pregel system for graph processing, and Storm for event 
processing. Well finish with some speculation about the longer- 
term future of Big Data. 

This class is good for developers, data analysts and managers, 
but people with Hadoop and/or programming experience will get 
the most out of it. 

Level: Overview 

Agile Analytics Applications with Hadoop EM 

Russell Jurney 

Attend this class to learn about rapidly prototyping analytics 
applications using the Hadoop stack to foster agile development 
in light of the ever-deepening analytics stack. The presentation 
uses Hadoop, Pig, NoSQL stores and lightweight Web frameworks 
to rapidly connect end users to real insights. Topics include writ¬ 
ing User Defined Functions in Pig, and publishing insights mined 
on Hadoop to NoSQL stores like MongoDB, HBase and Cassan¬ 
dra. You will also learn how to use Python, Ruby, JRuby and 
Node.js to present data mined on Hadoop in a browser. 

Level: Intermediate 

Hands-on MongoDB EM 

Lynn Langit 

Come to this class to learn how to use MongoDB, one of the 
most popular NoSQL databases. Together we will install and use 
MongoDB so that you can start understanding which business 
scenarios are a fit for this type of data storage. After setup, you’ll 
learn basic administration, such as data inserts, indexing and 
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more. Then you’ll get experience with the various ways to query a 
MongoDB database, using GUI tools such as MongoVUE, via the 
native query language, Map/Reduce, and the recently released 
aggregation framework. After this class, you will know how to use 
MongoDB in your projects. 

Prerequisite knowledge: some RDBMS admin or development 
is helpful, but not required. 

Setup software: The class will be taught on Macs, but you can 
implement on Windows if you wish. 

Level: Intermediate 

Analytics Maturity Model EH 

John A. De Goes 

Every company is at a different stage in leveraging analytics to 
improve their operational efficiency and product offerings. In this 
class, you will learn an eight-stage analytics maturity model that 
companies can use to determine how far they are from the most 
analytical companies. 

Level: Intermediate 

Distributed Search and Real-Time Analytics, Part II EH 

Jason Rutherglen and Ryan Tabora 

In this hands-on, two-part class, you will learn the importance 
of distributed search from the instructors’ industry experience 
and knowledge of real-world use cases. We’ll introduce different 
architectures that incorporate distributed search techniques, and 
share pain points experienced and lessons learned. Building on 
that, we’ll depict the landscape of distributed search tools and 
their future directions. 

For the hands-on part, you will learn how to install and use 
Apache Solr for real-time Big Data analytics, search, and reporting 
on popular NoSQL databases. You’ll also learn some tricks of the 
trade and how to handle known issues. 

We will spend around 30 minutes providing some background 
and use-case information on distributed search. We’ll then go 
through finer technical details with exercises every 10 minutes or 
so. We will go through the code on stage, but you can follow along 
with the EC2 instances that will be set up. 

Prerequisites: You should be familiar with Java and Unix shell 
commands. Some prior familiarity with SQL and Big Data solu¬ 
tions like Hadoop/Hbase/Cassandra/Solr would be helpful but 
not required. 

Level: Intermediate 

Big Data Science: Extracting Truth from Large, 
Multi-Structured Data, Part I 

Dan Mallinger 

Come to this class for an overview of Big Data Science from 
planning to execution. We will start by covering the processes and 
best practices for successfully doing Data Science in a business 
setting. We will then review a use case and walk through the ex¬ 
ploratory to confirmatory modeling stages. Each use case in- 
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eludes source code in R and Pig, with the goal of showing how to 
parallelize analysis in R over Hadoop. 

The use case will highlight the advantages (and difficulties) of 
working with multi-structured data through text analysis, linear 
models and more. A theme throughout will be the importance of 
triangulating around truth when problems are intractable. 

The following prerequisites ensure that you will gain the maxi¬ 
mum benefit from the class: 

Programming experience: Big Data is still the Wild West of 
technologies, and programming skills are required to wrangle and 
analyze Big Data. 

Analysis experience: Although this is not a statistics course, 
understanding the principles of analysis as well as research meth¬ 
ods will help in reapplying the lessons. 

Level: Advanced 

Getting Started with R and Hadoop, Part I EH 

Jeffrey Breen 

Increasingly viewed as the lingua franca of statistics, R is a nat¬ 
ural choice for many data scientists seeking to perform Big Data 
analytics. And with Hadoop Streaming, the formerly Java-only Big 
Data system is now open to nearly any programming or scripting 
language. This two-part class will teach you options for working 
with Hadoop and R before focusing on the RMR package from the 
RHadoop project. We will cover the basics of downloading and in¬ 
stalling RMR, and will test our installation and demonstrate its 
use by walking through three examples in depth. 

You will learn the basics of applying the Map/Reduce para¬ 
digm to your analysis and how to write mappers, reducers and 
combiners using R. We will submit jobs to the Hadoop cluster and 
retrieve results from the HDFS. We will explore the interaction of 
the Hadoop infrastructure with your code by tracing the input 
and output data for each step. Examples will include the canoni¬ 
cal “word-count” example, as well as the analysis of structured 
data from the airline industry. 

No specific prerequisite knowledge is required, but a familiar¬ 
ity with R and Hadoop or Map/Reduce is helpful. 

Level: Advanced 
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Untangling the Relationship Hairball with a 
Graph Database EM 

Max De Marzi 

Not only has data gotten bigger, it’s gotten more connected. 

Make sense of it all and discover what these Big Data connections 
can tell you about your users and your business. Come to this 
class to learn some of the different use cases for graph databases, 
and how to spot the non-obvious opportunities in your data. 

Level: Overview 

Mastering Sqoop for Data Transfer for Big Data MS 

Jaroslav Cecho and Kathleen Ting 

Apache Sqoop is a tool for efficiently transferring bulk data be¬ 
tween Hadoop-related systems (such as HDFS, Hive and HBase) 
and structured data stores (such as relational databases, data 
warehouses and NoSQL systems). 

Even though Sqoop works very well in production environ¬ 
ments, come to this class to learn how to handle bulk-transfer 
challenges that many Sqoop users face. 

Also come to this class to learn about Sqoop 2, the next genera¬ 
tion of the tool which is currently in development! Sqoop 2 will ad¬ 
dress ease of use, ease of extension and security. Well talk about 
Sqoop 2 from both the development and operations perspectives. 
Level: Intermediate 

Hadoop Programming with Scalding EES 

Dean Wampler 

Scalding is a Scala API for writing advanced data workflows for 
Hadoop. Unlike low-level APIs, it provides intuitive pipes and filters 
idioms, while hiding the complexities of Map/Reduce program¬ 
ming. Scalding wraps the Java-based Cascading framework in Func¬ 
tional Programming concepts that are ideal for data problems, 
especially the mathematical algorithms for machine learning. 

This class will use examples to demonstrate these points. De¬ 
velopers will see that Scalding is an ideal tool when they need a 
more full-featured and flexible tool set for Big Data applications, 
beyond what Hive or Pig can provide. 

Level: Intermediate 

Real-Time Hadoop EES 

Boris Lublinsky and Michael Segel 

We will start with a short introduction of different approaches 
to using Hadoop in the real-time environment, including real¬ 
time queries, streaming and real-time data processing and deliv¬ 
ery. Then well describe the most common use cases for real-time 
queries and products implementing these capabilities. We will 
also describe the role of streaming, common use cases, and prod¬ 
ucts in the space. 

The majority of time will be dedicated to the usage of HBase as 
a foundation for the real-time data process. We describe several 
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architectures for such implementation, as well as a high-level de¬ 
sign and implementation for two examples: a system for storing 
and retrieving images, and using HBase as a back end for Lucene. 

Level: Intermediate 

Getting Started with Predictive Modeling: Simple Models 
and Basic Evaluation 

Claudia Perlich 

This class presents the basic steps of building a predictive 
model, and encourages you to get your feet wet on a small prob¬ 
lem using Excel and Weka to build a model. 

You will learn about selection training examples, constructing 
an appropriate feature vector, some basic feature selection, 
building models using three to four different machine-learning 
techniques (logistic regression, decision trees and rule induc¬ 
tion), and evaluating the resulting models using cross validation 
and a separate test set. 

Level: Intermediate 

Big Data Science: Extracting Truth from Large, 
Multi-Structured Data, Part II 

Dan Mallinger 

Come to this class for an overview of Big Data Science from 
planning to execution. We will start by covering the processes and 
best practices for successfully doing Data Science in a business 
setting. We will then review a use case and walk through the ex¬ 
ploratory to confirmatory modeling stages. Each use case in¬ 
cludes source code in R and Pig, with the goal of showing how to 
parallelize analysis in R over Hadoop. 

The use case will highlight the advantages (and difficulties) of 
working with multi-structured data through text analysis, linear 
models and more. A theme throughout will be the importance of 
triangulating around truth when problems are intractable. 

The following prerequisites ensure that you will gain the maxi¬ 
mum benefit from the class: 

Programming experience: Big Data is still the Wild West of 
technologies, and programming skills are required to wrangle and 
analyze Big Data. 

Analysis experience: Although this is not a statistics course, 
understanding the principles of analysis as well as research meth¬ 
ods will help in reapplying the lessons. 

Level: Advanced 

Getting Started with R and Hadoop, Part II EES 

Jeffrey Breen 

Increasingly viewed as the lingua franca of statistics, R is a nat¬ 
ural choice for many data scientists seeking to perform Big Data 
analytics. And with Hadoop Streaming, the formerly Java-only Big 
Data system is now open to nearly any programming or scripting 
language. This two-part class will teach you options for working 
with Hadoop and R before focusing on the RMR package from the 
RHadoop project. We will cover the basics of downloading and in- 
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stalling RMR, and will test our installation and demonstrate its 
use by walking through three examples in depth. 

You will learn the basics of applying the Map/Reduce para¬ 
digm to your analysis and how to write mappers, reducers and 
combiners using R. We will submit jobs to the Hadoop cluster and 
retrieve results from the HDFS. We will explore the interaction of 
the Hadoop infrastructure with your code by tracing the input 
and output data for each step. Examples will include the canoni¬ 
cal “word-count” example, as well as the analysis of structured 
data from the airline industry. 

No specific prerequisite knowledge is required, but a familiar¬ 
ity with R and Hadoop or Map/Reduce is helpful. 

Level: Advanced 

3:15 pm -4:15 pm 

How to See and Understand Big Data 

Jock Mackinlay 

Visual analysis is an iterative process that exploits the power of 
the human visual system to help people work with all kinds of 
data. When data is big, people must overcome the challenges of 
wide data, tall data, and data from multiple sources, often coming 
in fast and furiously. Attend this class to learn how people work¬ 
ing with data can address these challenges. The key technique is 
to use multiple coordinated views of data during visual analysis 
and storytelling with data. 

You’ll learn: 

• What research and practice have taught us about designing 
great visualizations and dashboards. 

• Fundamental principles for designing effective coordinated 
views for yourself and others. 

• How to systematically analyze data from multiple databases 
using your visual system. 

The instructor works for Tableau Software, a provider of data 
visualization solutions. 

Level: Overview 

Seven Deadly Hadoop Misconfigurations MB 

Kathleen Ting 

Misconfigurations and bugs break the most Hadoop clusters. 
Fixing misconfigurations is up to you! Attend this session to learn 
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how to get your Hadoop configuration right the first time. In 
some support contexts, a handful of common issues account for a 
large fraction of issues. That is not the case for Hadoop, where 
even the most common specific issues account for no more than 
2% of support cases. Hadoop errors show up far from where you 
configured, making it hard to know what log files to analyze. It 
pays to be proactive. Come to this class! 

Level: Intermediate 

Hands-on Hadoop for Developers—Understanding 
Map/Reduce MB 

Lynn Langit 

In this class, you’ll get hands-on experience working with a 
leading implementation of Hadoop: Cloudera’s CDH4 release. 
Well work with their the company’s virtual machine and concen¬ 
trate on learning how to get business value from a Hadoop cluster 
of data by writing increasingly complex queries. These queries are 
known as Map/Reduce jobs, and we will write them in Java. Even 
if you aren’t a Java pro, you can still get value from this class, as we 
will also work with higher-level languages, such as Hive’s HQL 
and Pig. You’ll finish this class understanding when and how to 
get value from a Hadoop cluster. 

Prerequisite knowledge: Database development using T-SQL; 
Java or C# helpful, but not required. 

Prerequisite software: Cloudera Hadoop VM v CDH4. A link 
will be provided for download. 

Level: Intermediate 

The Dark Sides of Predictive Modeling: 

Tricks and Pitfalls EES 

Claudia Perlich 

This class will teach advanced strategies to improve model per¬ 
formance, including multi-state modeling and advanced feature 
engineering. In addition, we will look at some of the most com¬ 
mon pitfalls: over-fitting and leakage. 

With the growing set of tools available for data storage, man¬ 
agement, consolidation and preprocessing, the art of exploratory 
data analysis is getting lost. Too often are we trying to make sense 
of data that is “cleaned” beyond recognition. It makes a major dif¬ 
ference whether missing values have been removed or replaced 
by something else. 

In the end, data analysts work with data that are not what they 
think they are; they are unaware of sampling biases or informa¬ 
tion “from the future” that really should not be allowed in the 
model. As a result, the models look initially very good, and fail 
when used in reality. We will also cover some strategies of manag¬ 
ing really large amounts of data, including strategic sampling and 
grouping examples. 

Prerequisites: You should have some experience in predictive 
modeling. 

Level: Advanced 
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NoSQL for SQL Professionals 

Dipti Borkar 

With all of the buzz around Big Data and NoSQL (non-rela¬ 
tional) database technology, what actually matters for today’s SQL 
professional? Learn more in this talk about Big Data and NoSQL 
in the context of the SQL world, and get to what’s truly important 
for data professionals today 

In this presentation you’ll learn: 

• The main characteristics of NoSQL databases 

• Differences between distributed NoSQL and relational data¬ 
bases 

• Use cases for NoSQL technologies, with real-world examples 
from organizations in production today 

Level: Intermediate 

Hadoop by Example EM 

Serge Blazhievsky 

This class is designed to demonstrate the most commonly 
used Map/Reduce design patterns for various problems. Perform¬ 
ance and scalability will be taken into consideration. 

The class will present a general overview of the problems that 
can be solved using Map/Reduce, scalability and performance 
tuning for clusters of different sizes. The techniques described 
here can be used on all Hadoop distributions. 

The following technical problems will be covered: 

• “Hello world!” of the Map/Reduce universe—a word-count 
example 

• Mapping only Map/Reduce jobs and their usage for ETL-type 
jobs 

• Global sorting techniques 

• Sequencing files and its usage in Map/Reduce jobs 

• Mapping files and its usage in Map/Reduce jobs 

• Reduce-side join and its advantages and limitations 

• Map-side join and its advantages and limitations 

Each technique will be provided with a code example that can 
be used as a template. No prior knowledge about the topic is re¬ 
quired; however, some Java knowledge is recommended. 

Level: Intermediate 
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In-Database Predictive Analytics EM 

John A. De Goes 

Predictive analytics have long lived in the domain of statistical 
tools like R. Increasingly, however, as companies struggle to deal 
with exploding volumes of data not easily analyzed by small data 
tools, they are looking at ways of doing predictive analytics di¬ 
rectly inside the primary data store. 

This approach, called in-database predictive analytics, elimi¬ 
nates the need to sample data and perform a separate ETL 
process into a statistical tool, which can decrease total cost, im¬ 
prove the quality of predictive models, and dramatically shorten 
development time. In this class, you will learn the pros and cons 
of doing in-database predictive analytics, highlights of its limita¬ 
tions, and survey the tools and technologies necessary to head 
down the path. 

Level: Advanced 

Wednesday, April 10 

8:45 am-9:45 am 

Building Successful Data Science Teams 

Dan Mallinger 

Come to this class and join a lively conversation on Big Analyt¬ 
ics. We’ll discuss current trends in the field, including the new role 
of Data Science beyond analytics, and how companies are em¬ 
bracing new technologies and approaches. 

We will frame the discussion with a client use case, covering 
objectives, approaches and challenges encountered. Through this 
lens, we’ll explore how organizations and individuals can transi¬ 
tioning skills into the Big Data space from more traditional roles, 
hiring strategies and team structures. Finally, we will discuss the 
patterns that define a mature Data Science team and how firms 
will grow to assess and evaluate these teams. 

While our conversation will be technology-agnostic, we will 
focus particular attention to data science within the Hadoop 
ecosystem. 

What you’ll learn: 

• Best practices in Data Science projects 

• Common technologies and tools 

• Training and hiring strategies 

• Best team structures 

• The current state of the field 

• KPIs that define a mature Data Science practice 
Level: Overview 
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Building Applications Using HBase mu 

Amandeep Khurana 

Apache HBase is one of the popular new-age scalable NoSQL 
databases, and has seen a significant increase in production use 
cases over the last couple of years. HBase is an open-source ver¬ 
sion of Google BigTable. 

This class will teach you the basics of HBase, including its ar¬ 
chitecture, data model, and how HBase is different from tradi¬ 
tional database systems in terms of design assumptions and also 
with respect to building applications. 

Level: Intermediate 

Oozie: A Workflow Scheduler for Hadoop WM 

Boris Lublinsky 

This class will start with a discussion of Oozie’s role in the 
Apache Hadoop ecosystem and its relationship to other Hadoop 
platform components. We will then describe the most straightfor¬ 
ward use cases/examples where Oozie can be helpful or even 
necessary. From that we move to Oozie workflow specification: 
components, illustrated with examples for various Oozie actions 
(Java, Map/Reduce, Hive, Pig, Sqoop). 

Then we will describe Oozie architecture. We will present the 
main Oozie components, job life cycle, retry and recovery. We will 
demonstrate the Oozie management console in integration/com¬ 
plementary usage with other Hadoop GUI tools (Map/Reduce Ad¬ 
ministration, Task Tracker, NameNode viewer, Fair Scheduler 
Administration, and Log File View). 

We will then discuss Oozie job parameterization, expression 
language, job configuration, runtime artifacts placement, and job 
submission using the Oozie command-line utility (CLI). We will 
demonstrate how to check statuses or stop Oozie jobs from CLI. 

We will also briefly describe Oozie APIs (Java and REST), and pro¬ 
vide fragments of code. After that, we will present Oozie coordi¬ 
nator and discuss how Oozie allows expressing dependencies 
between jobs and groups of jobs using “Synchronous Datasets.” 

We will touch on Oozie SLA support. 

Then we will describe Oozie bundles and demonstrate the 
whole hierarchy of Oozie artifacts: from bundles to coordinator to 
workflows to actions to processes running on Hadoop clusters. In 
conclusion, we will discuss Oozie limitations and ways to over¬ 
come some of them (via extension and customization). We will 
also talk about new features in future versions. 

Level: Intermediate 

Extending Your Data Infrastructure with Hadoop EE23 

Jonathan Seidman 

Hadoop provides significant value when integrated with an ex¬ 
isting data infrastructure, but even among Hadoop experts there’s 
still confusion about options for data integration and business in¬ 
telligence with Hadoop. This class will help clear up the confu¬ 
sion. You will learn: 
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• How can I use Hadoop to complement and extend my data 
infrastructure? 

• How can Hadoop complement my data warehouse? 

•What are the capabilities and limitations of available tools? 

• How do I get data into and out of Hadoop? 

• How can I use my existing data-integration and 
business-intelligence tools with Hadoop? 

• How can I use Hadoop to make my ETL processing more 
scalable and agile? 

We’ll illustrate this with an end-to-end example data flow using 
open-source and commercial tools, showing how data can be im¬ 
ported and exported with Hadoop, ETL processing in Hadoop, 
and reporting and visualization of data in Hadoop. You will also 
learn recent advancements that make Hadoop an even more 
powerful platform for data processing and analysis. 

Level: Intermediate 

Matrix Methods with Hadoop EES 

David Gleich 

Get a brief introduction to thinking about data problems as 
matrices, and then learn how to implement many of these algo¬ 
rithms in the Hadoop streaming framework. The data-as-matrix 
paradigm has had a rich history, and the point of this talk is to 
give folks some idea of which statistical algorithms are likely to be 
reasonably efficient in Map/Reduce, and which are probably not 
going to be so reasonable. This will involve a few ideas: 

• How to store matrix data, and the performance tradeoffs. 

The idea is to take natural ways of looking at methods to 
store data and describe them as a way of storing a matrix. 
This gives some insight into how a method could be fast. 

• How to implement some basic matrix operations that form 
the basis of many numerical and statistical algorithms. 

• Problems we’ve run into working with matrix data, and how 
we’ve solved some of them. 

• Ideas for future platforms that are ideal for this case. 

• A concern about the numerical accuracy of Big Data. Most of 
the code samples will use Python interfaces for Hadoop 
streaming, such as Dumbo, mrjob or Hadoopy. 

Prerequisites: 

• You ought to have a rough handle on what a matrix represents. 

• Those with a linear algebra background will probably get 
more out of this talk, but I’ll explain any linear algebra prop¬ 
erty with a statistical or data-oriented analog. 

• You should know about how to use Map/Reduce or Hadoop, 
most of the code examples will use Hadoop streaming via 
Dumbo. 

Level: Advanced 
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Microsoft’s Big Data Story 

Lynn Langit 

Big Data is hot, but with so much press covering open-source 
projects, what does Microsoft have to offer in this area? More than 
you might expect! From extensions to core products, such as the 
addition of columnstore indexes in SQL Server 2012, to entirely 
new products, such as Data Explorer, you will learn about the 
“Microsoftification” of current Big Data offerings, such as 
“Hadoop on Azure.” You’ll leave this class understanding how you 
can best leverage your existing investment in Microsoft technolo¬ 
gies to take advantage of Big Data opportunities. 

Level: Overview 

Setting up a Neo4j Graph Database Cluster 
on Amazon EC2 EE3 

Max De Marzi 

Learn how to set up a Neo4j Enterprise Cluster on Amazon EC2 
to handle your very connected Big Data. Hints, tips and best prac¬ 
tices will help you get started with a proof of concept or full pro¬ 
duction system. The instructor works for Neo Technology, 
creators of the Neo4j Graph Database. 

Level: Intermediate 

HBase Schema and Table Design Principles EES 

Amandeep Khurana 

More and more companies are adopting the open-source 
Apache HBase as the back-end database for Big Data applications 
that have very large tables - billions f rows, millions of columns. 
Come to this class to learn how to design tables to leverage the 
design principles and features of HBase. 

HBase tables are very different from relational database tables, 
and there are several concepts at play that you need to keep in 
mind while designing them. This class will introduce you to the 
basics of HBase schema and table design and how to think about 
tables when working with this popular database. 

Level: Intermediate 

How to Fit a Petabyte in Apache HBase EES 

Jean-Daniel Cryans 

There are many ways to load a petabyte of data in HBase, and 
this class will show you the best approaches! We will first review 
the solutions that are commonly adopted instinctively, and show 
why they fail. This will help you understand the practicalities of 
HBase’s architecture. 

The first technique that will be taught is better schema de¬ 
signs—that is, how to create keys that won’t inflate the size of your 
data set. The second technique that will be presented is better 
management of the loading of the data through pre-splitting of 
the regions and tuning the cluster for that type of workload. 

Finally, we’ll show you how to use bulk loading will be dis- 
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cussed as the best way to populate the database. The code that 
will be used for the demonstrations will be available on GitHub. 

Level: Advanced 

Hadoop Backup and Disaster Recovery 101 EES 

Jairam Ranganathan 

Any production-level implementation of Hadoop must have its 
data protected from threats. Threats to data integrity can be 
human-generated (malicious/unintentional) or site-level (power 
outage, flood, etc.). As soon as you start to identify these threats, 
it’s important to develop a backup or disaster-recovery solution 
for Hadoop! 

In this class, you will learn the unique considerations for 
Hadoop backup and disaster recovery, as well as how to navigate 
the common issues that arise when architects and developers 
look to protect the data. 

We’ll cover: 

• How to model your backup / disaster-recovery solution, consid¬ 
ering your threat model and specifics around data integrity, 
business continuity, and load balancing. 

• Best practices and recommendations, highlighting Hadoop in 
contrast to traditional SAN/DB systems; replication versus 
"teeing" models for ensuring DR; replication scheduling; Hive; 
HBase; managing bandwidth; monitoring replication; using 
one's secondary beyond replication; and a survey of existing 
tools and products that can be used for backup and DR 

After taking this class, you should be able to explain to your or¬ 
ganization the right way to effect a backup or data recovery solu¬ 
tion for Hadoop. 

Level: Intermediate 

What You Can Do with a Tall-and-Skinny QR Factorization 
on Hadoop EES 

David Gleich 

A common Big Data-style dataset is one with a large number 
(millions to billions) of samples with up to a few thousand fea¬ 
tures. One way to view these datasets from an algorithmic per¬ 
spective is to treat them as a tall-and-skinny matrix. 

In this class, you will learn one of the most widely used matrix 
decomposition techniques: the QR factorization. 

First, you’ll see how to use the QR factorization to solve various 
statistical analysis problems on datasets with many samples. For in¬ 
stance, we’ll see how to compute a linear regression for such a prob¬ 
lem, how to compute principal components, and how easily we can 
implement a new scalable Kernel K-Means clustering method. 

Next, you will learn how to compute these QR factorizations in 
Hadoop. We use an algorithm that maps beautifully onto the 
Map/Reduce distributed computational engine. It boils down to 
computing independent QR factorizations in each mapper and re¬ 
ducer stage. One challenge we faced was in improving the numeri¬ 
cal stability of the routine in order to get a good estimate of the 
orthogonal matrix factor, which is most important for applications 
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that need a tall-and-skinny matrix singular value decomposition. 
Prerequisites: 

• You ought to have a rough handle on what a matrix repre¬ 
sents. 

• Those with a linear algebra background will probably get 
more out of this talk, but Ill provide any linear algebra 
property with a statistical or data-oriented analog. 

• You should know about how to use Map/Reduce or Hadoop; 
most of the code examples will use Hadoop streaming via 
Dumbo in Python. 

• If you attend my other talk, “Matrix Methods with Hadoop,” 
youll get more out of this class, but it isn’t required. 

• If you are familiar with linear regression, principal compo¬ 
nents or Kernel matrices, youll get more out of this class. 

Level: Advanced 

High-Speed Data Ingestion with Sharded 
NewSQL Databases EM 

Ryan Betts 

Websites and back-end data applications have grown to the 
point that the data coming in is too fast for traditional databases 
without causing significant latencies for the clients. In this ses¬ 
sion, we will look at how to take advantage of NewSQL sharding 
to consume data as fast as possible, and then extract it to a tradi¬ 
tional database in a transactionally secure manner. 

Well combine traditional slides with real code, demonstrating 
how to quickly import and extract from a high-speed NewSQL 
database to a slower-speed traditional SQL database. The instruc¬ 
tor works for VoltDB, which use the NewSQL term to describe its 
VoltDB database. 

Level: Advanced 

1:45 pm -2:45 pm 

Implementing a Simple Mongo Application 

Andrew C. Oliver 

One simple app, three different technologies deployed locally 
to two different clouds. The application is executed, the source is 
examined, the approach is compared, the tools are demonstrated, 
and your questions are answered. 

Level: Overview 

Implementing a Real-Time Data Platform Using HBase ES3 

Ravi Veeramachaneni 

Apache HBase is a distributed, column-oriented data store for 
processing large amounts of data in a scalable and cost-effective 
way. Most of the developers, designers and architects have lots of 
background or experience in working on relational databases 
using SQL; the transition to NoSQL is challenging and oftentimes 
confusing. 

Come to this class to learn what and what not to consider 
when using HBase while implementing a real-time data-manage- 
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ment system for your organization. The focus will be on sharing 
knowledge and real-world experiences to help the audience un¬ 
derstand and address the full spectrum of technical and business 
challenges. We will offer recommendations and lessons learned in 
terms of scalability, performance, reliability and transitioning to 
the new platform. 

You will learn the major considerations for infrastructure, 
schema design, implementation, deployment and tuning of HBase 
solutions. At the end of the class, you will be in a better position to 
make the right choices on applicability, design, tuning and infra¬ 
structure selection for a real-time HBase-enabled data platform. 
Level: Intermediate 

Map/Reduce Tips and Tricks MS 

Boris Lublinsky 

This class will start with a short Map/Reduce architectural re¬ 
fresher, showing how it executes in Hadoop and what the main 
Map/Reduce components and classes are, which can be used for 
customizing an execution. We will then describe the most com¬ 
mon possible Map/Reduce customizations and the reasons for 
their implementation. 

The majority of time will be dedicated to going through the 
code examples, showing how to design and implement custom 
input/output formats, readers and writers, and partitioners. We 
will also show what to expect out of any customization. 

Time permitting, we will also talk about high-level Map/Re¬ 
duce frameworks, namely Apache Crunch. 

Level: Intermediate 


Using Apache HBase’s API for Complex Real-Time 
Queries MS 

Jean-Daniel Cryans 

Apache HBase is generally viewed as an augmented key-value 
store, but what does that mean really? In this class, you will ex¬ 
plore the basic functionalities (get, put, delete and scan), and 
then fast-forward to the good parts. You will see how to use the fil¬ 
ters in order to run more efficient scans, like running lightweight 
counts over millions of rows. 

The class will also cover the different compare-and-swap (CAS) 
operations that HBase enables by being strongly consistent, like 
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Wednesday, April 10 

incrementing counters or appending data without having to 
bring the data back to the client. We’ll also cover more advanced 
features like using coprocessors. The code that will be used for 
the demonstrations will be available on GitHub. 

Level: Advanced 

Selecting the Right Big Data Tool for the Right Job, and 
Making It Work EM 

Eddie Satterly 

This class will focus on wide range of Big Data solutions from 
open-source to commercial solutions, and the specific selection 
criteria and profiles of each. As in all technology areas, each solu¬ 
tion has its own sweet spots and challenges, either in CAP theo¬ 
rem, ACID compliance, performance or scalability. This class will 
provide an overview of the technical tradeoffs for the list of solu¬ 
tions in technical terminology. Once the technical tradeoffs are 
reviewed, we will review the cost and value of open-source solu¬ 
tions versus commercial software, and the tradeoffs that folks 
must take to choose one over the other. 

The next phase will go into great detail on use cases for specific 
solutions based on real-world experience. All of the specific use 
cases have been seen firsthand from our customers. The solutions 
will be reviewed to the level of specific technical architecture and 
deployment details. This is intended for a highly technical audi¬ 
ence and will not provide any high-level material on the solutions 
discussed. It is assumed that you will have working knowledge of 
solutions such as SQL, NoSQL, distributed file systems and time 
series indexes. You should also have an understanding of CAP 
theorem, ACID compliance and general performance characteris¬ 
tics of systems. 

Level: Advanced 

3:30 pm -4:30 pm 

A Survey of Probabilistic Data Structures MS 

Jim Duey 

Big Data requires Big Resources, which cost Big Money. But if 
you only need answers that are good enough, rather than pre¬ 
cisely right, probabilistic data structures can be a way to get those 
answers with a fraction of the resources and cost. 

This class will teach you about different data structures, give 
some theory behind them, and point out some use cases. You will 
learn how those structures can be used for common tasks, includ¬ 
ing counting the unique items in a collection, determining if an 
item is in a collection, and counting the occurrences of items in a 
collection. If you want Big Data without spending Big Money, this 
class is for you! 

Level: Intermediate 
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Running Mission-Critical Applications on Hadoop 

Dave Jespersen 

This class will look at what is involved when you move Hadoop 
from a lab environment to actual deployment in production. We 
will cover the critical enterprise-grade features like data integra¬ 
tion, data protection, business continuity, and high availability, and 
discuss the ways you can accomplish this in your environment. We 
will also identify potential stumbling blocks, identify what a plat¬ 
form can or can’t provide, and help determine the scope and level 
of customization necessary to make your deployment successful. 

At the end of the class, you will better understand how to move 
Hadoop from the test bed to production deployment, what is in¬ 
volved in the process, and how to run a mission-critical Hadoop en¬ 
vironment. Where appropriate, there will be real-world examples. 
Level: Intermediate 

Visualizing Big Data in the Browser EM 

Simon Metson 

Whether for exploratory investigation or final presentation, vi¬ 
sualization and data go hand in hand. We posit that visualization 
is in fact the vehicle by which data scientists, analysts and devel¬ 
opers identify and disseminate the true value in data. Is visualiza¬ 
tion technology keeping pace with recent explosive innovations 
in data management, storage and processing? In a word: yes. 

In this class, you will learn how to create custom, agile, fly¬ 
weight in-browser applications for data analysis and visualiza¬ 
tion. We will leverage modern browser tools such as D3, Cubism 
and Crossfilter to do client-side data processing and visualization 
in concert with server-side distributed Map/Reduce processing in 
a scalable document database. 

You will create a powerful, two-tier application that is served di¬ 
rectly to the browser from a distributed database, allowing you to 
deploy on cloud-hosting providers with no server install required. 

Requirements: The class will be a mix of lecture and demonstra¬ 
tion. Come prepared with a laptop, wi-fi and a modern Web browser 
(Safari, Chrome or Firefox) and you too will make data beautiful. 
Level: Advanced 

Building High-Volume Web Applications EM 

Stefano Santoro 

Keeping a Web application fast under highly variable user load 
presents website developers with a frustrating problem: What 
happens when load spikes and the site can't scale out? This often 
requires using a variety of techniques to ensure high perform¬ 
ance. We'll cover how to manage fast, slow and very slow-moving 
data, as well as methods for ensuring fast client responses. 

Using Spring, Ehcache, jQuery, JSON and VoltDB, you will learn 
howto write high-throughput, low-latency Web applications that 
can keep up with heavy, unexpected load from the front to the 
back end. The instructor works for VoltDB. 

Level: Advanced 
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Ryan Betts 

Ryan is one of the original VoltDB developers, and he 
is currently working as VoltDB’s field CTO. He spends 
his time listening to customers and helping them to 
understand how to extract value from real-time 
high-velocity data. Ryan has been writing and designing systems 
software, ranging from telco to secure Web service appliances, for 
15 years, and he is passionate about building unique products 
that solve previously impossible problems. 

Serge Blazhievsky 

Serge is Principle Software Engineer at Nice Systems, 
and is an experienced developer and architect with a 
rich background in C++/Java and distributed systems. 
Nice Systems uses Hadoop infrastructure for various 
data-processing needs. His previous company used Hadoop in¬ 
frastructure for all reporting needs, and before that, Serge de¬ 
signed Hadoop infrastructure used for Internet crawling and 
Web-page analysis. Serge holds a Masters Degree in Computer 
Engineering from Santa Clara University. Serge is a regular con¬ 
tributor to various Hadoop conferences, including the Hadoop 
User Group at Yahoo, the creator of Hadoop. 

Dipti Borkar 

Dipti is Director of Product Management at Couch- 
base where she is responsible for the company’s flag¬ 
ship product, Couchbase Server, and works with 
customers and users to understand emerging require¬ 
ments for low-latency, scalable data stores. Dipti has deep techni¬ 
cal experience in the database industry having worked at IBM as 
a software engineer and Development Manager for the DB2 
server team, and then at MarkLogic as a Senior Product Manager. 

Oscar Boykin 

Oscar, Ph.D., is a Staff Software Engineer at Twitter. He 
is co-author of Scalding, Twitter's library for large-scale 
Hadoop programming. At Twitter, he is building sys¬ 
tems for real-time Big Data with applications to ad tar¬ 
geting. Before joining Twitter, he was an Assistant Professor at the 
University of Florida in Electrical and Computer Engineering 
where his research was focused on large-scale distributed systems. 

Jeffrey Breen 

Jeffrey is the Principal of the Think Big Academy at 
Think Big Analytics. Jeffrey has been very active in 
local user groups, has taught and mentored through¬ 
out his career, and has presented talks recently on R 
and Hadoop to the Data Warehouse Institute, the Chicago Area 
Hadoop and R User groups, and the Boston Predictive Analytics 
Meetup. Jeffrey has also developed and delivered the RHadoop 
training course, as well as all materials for Revolution Analytics. 
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Jaroslav Cecho 

Jaroslav works as a Software Engineer at Cloudera, and 
is a committer and PMC member in three top-level 
Apache projects: Sqoop, Flume and MRUnit. 


Ben Coverston 

Ben currently helps coordinate the training and sup¬ 
port activities at DataStax. He has over 15 years of de¬ 
velopment experience, and has written code running 
on some of the largest travel websites in the world. He 
became interested in Big Data through his experiences in trou¬ 
bleshooting data-related problems in which the velocity and vol¬ 
ume of data exceeded the capabilities of a single machine. 



Jean-Daniel Cryans 

Jean-Daniel works as a Software Engineer at Cloudera 
on the Storage team, where he works on making 
Apache HBase better. Previous to that, he worked at 
StumbleUpon where he also worked on HBase while 
maintaining its production deployment. Jean-Daniel enjoys 
teaching HBase to newcomers and old-timers alike in the open- 
source community, or by giving presentations at Big Data and 
Apache Hadoop-related conferences and meetups. He became a 
committer and PMC member on HBase in 2008 when he was still 
an undergrad student at ETS Montreal. 



John A. De Goes 

John is CEO and CTO of Precog, and is responsible for 
leading the design and development of the company’s 
data-warehousing and analysis platform. He has been 
working professionally in distributed systems design 
and development for more than a decade. 

Author of multiple best-selling technical books, and a major 
contributor to open source, John has an extensive background in 
scientific and distributed computing, and in large-scale analytics. 
John is a frequent and well-received speaker at industry events. 
Recent engagements include DataWeek Conference, Glue Confer¬ 
ence, Frontier Developers, and NEScala. 



Max De Marzi 

Max is a Software Field Engineer at Neo Technology, 
where he built Neography Ruby Gem, a rest API wrap¬ 
per to the Neo4j Graph Database. He is addicted to 
learning new things, and he loves a challenge and 
finding (and sharing) pragmatic solutions. 



Jim Duey 

Jim is a Programmer at Lonocloud, and has been a 
professional programmer for more than 20 years (the 
last three at Clojure). He’s done work with embedded 
systems in multiple industries and languages like 
Forth, Delphi and C++. Jim’s blog on Clojure programming is at 
clojure.net. 
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Daniel Eklund 

Daniel is Principal Consultant for Think Big Analytics, 
and a software architect and technologist with over 15 
years of experience in enterprise software develop¬ 
ment. Daniel works with leading Fortune 500 customers to imple¬ 
ment Big Data/NoSQL storage and compute systems using 
Hadoop, Hive, Pig, Cascading, Cassandra and HBase. 

David Gleich 

David is an Assistant Professor of Computer Science at 
Purdue University, and is interested in how we can uti¬ 
lize matrix algebra to express — and improve — algo¬ 
rithms in network analysis and data-based simulation 
analysis. David's research straddles a few different areas and often 
involves working with large datasets on high-performance com¬ 
puting architectures like MPI clusters, and data computing archi¬ 
tectures, such as Map/Reduce. 
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Russell Jurney 

Russell is a Hadoop Evangelist at Hortonworks, but 
first cut his data teeth in casino gaming, building Web 
apps to analyze the performance of slot machines in 
the US and Mexico. After dabbling in entrepreneur- 
ship, interactive media and journalism, he moved to Silicon Val¬ 
ley to build analytics applications at scale at Ning and Linkedln. 
He lives on a cliff above the ocean in Pacifica, California with his 
wife Kate and two fuzzy dogs. 



Amandeep Khurana 

Amandeep is a Solutions Architect at Cloudera, where 
he is involved in building out solutions using compo¬ 
nents in the Hadoop ecosystem. He is also a co-author 
of the “HBase in Action" book. Prior to joining Cloud- 
era, Amandeep was at Amazon Web Services and was a part of the 
Elastic Map/Reduce team. 



0 Mark Grover 

Mark is a Software Engineer at Cloudera and a con¬ 
tributor to the Apache Hive open-source project. He is 
also a section author of O'Reilly's book on Apache 
Hive called “Programming Hive." Mark is an active re¬ 
spondent on the Hive mailing list and IRC channel. 


David Hwang 

David is the Founder and Chief Scientist of Restaurant 
y* Sciences. In his current role, he and his research team 
analyze data from thousands of restaurants across 
North America, and deliver insights back to industry 
suppliers on America's eating and drinking patterns. Mongo, R, 
Mahout, Python, D3 and ZFS all comprise the technology base of 
his current project. He is a true testament for what can be accom¬ 
plished with sub-normal intelligence, open-source software and a 
penchant for hard work. 


Dave Jespersen 

Dave brings his deep engineering experience to his 
role of chief customer advocate at MapR Technolo¬ 
gies. He enriches the customer experience by working 
with MapR's customer base to develop and imple¬ 
ment innovative solutions to the complex problems faced by 
every enterprise. 

He was previously VP of Engineering at MapR, where he led the 
development of MapR's industry-leading products. Dave has 30 
years of successful enterprise software development experience 
in both small and large companies, including EMC, Sun Mi¬ 
crosystems, Sterling Software, Spectra Logic, Exabyte and DEC. 
Dave was educated at Brigham Young University, where he earned 
a BS M.E. and a minor in Computer Science. 



Adam Laiacano 

Adam is a Data Scientist and Engineer at Tumblr, a 
blogging network with over 90 million blogs, where 
he's responsible for collecting and analyzing large vol¬ 
umes of data to gain a better understanding of trends 
and activity within the Tumblr community. He holds a Bachelor of 
Science degree in Electrical Engineering from Northeastern Uni¬ 
versity, and designed signal detection systems for low-power 
atomic clocks before joining Tumblr. 

Lynn Langit 

Lynn is an independent consultant who specializes in 
in database technologies (SQL and NoSQL, both on 
premise and cloud-based). Lynn has published three 
books on SQL Server Business Intelligence, and has 
created a set of courseware to introduce children to programming 
at www.TeachingKidsProgramming.org. Lynn has been recog¬ 
nized by Google and lOGen for technical community contribu¬ 
tions. Google awarded her the Google Developer Expert award, 
and lOGen added her to the list of MongoDB Masters. Since leav¬ 
ing a four-year stint at Microsoft in October 2011, Lynn has been 
working on building Big Data and BI solutions with customers 
from the education, manufacturing and hospitality sectors. She 
has authored courseware on SQL Server 2012 for DevelopMentor. 
Read her blog at www.LynnLangit.com. 

Boris Lublinsky 

Boris is a principal architect with Nokia, where he is actively par¬ 
ticipating in Big Data, SOA, BPM and middleware implementa¬ 
tions. Prior to this, Boris was a principal architect at Herzum 
Software, where he designed large-scale SOA systems for clients; 
and an enterprise architect at CNA Insurance, where he was in¬ 
volved in designing and implementing CNA's integration strategy, 
building application frameworks and implementing service-ori¬ 
ented architectures. 

Boris has more than 25 years' experience in enterprise and 
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technical architecture, and software engineering. He has more 
than 80 technical publications in different magazines, including 
Distributed Computing, Nuclear Instruments and Methods, Java 
Developer’s Journal, XML Journal, Web Services Journal, and EAI 
Journal. Boris is also a co-author of “Applied SOA: Service-Ori¬ 
ented Architecture and Design Strategies” and an SOA news edi¬ 
tor at Infoq.com. 

Jock Mackinlay 

I Jock is Tableau Software’s Senior Director of Visual 
I Analysis. At Stanford University, he pioneered the au- 
tomatic design of graphical presentations of relational 
information. He joined Xerox PARC in 1986, where he 
collaborated with the User Interface Research Group to develop 
many novel applications of computer graphics for information 
access, coining the term “Information Visualization.” Much of the 
fruits of this research can be seen in his book, “Readings in Infor¬ 
mation Visualization: Using Vision to Think.” Jock has a Ph.D. in 
computer science from Stanford University. 

Dan Mallinger 

Dan is the Data Science Team Lead at Think Big Ana¬ 
lytics. His experience in Big Data and Hadoop has run 
the gamut of customer data, hardware networking 
data, social data, gaming data, and more. Dan’s work 
in these verticals includes research, analysis and parallelization of 
techniques. In prior lives, he has worked as a statistician in a vari¬ 
ety of settings—including research in K-12 education—has expe¬ 
rience architecting systems, and was briefly a high school math 
teacher. 

Simon Metson 

Simon is an Engineer at Cloudant, and also a particle 
physicist. For the past 10 years, he worked on the dis¬ 
tributed computing system for one of the Large 
Hadron Collider experiments, aggregating petabytes of relational 
and non-relational data sources. In 2012, Simon joined Cloudant, 
where he designs software and creates educational material to 
help developers interact directly with massive datasets. 




: Michael Miller 

► 

v Mike is Chief Scientist at Cloudant, where he develops 

and evangelizes the company’s technical vision and 
manages long-term product R&D. While at MIT as a 
■ Postdoctoral Fellow, he cofounded Cloudant after cut¬ 
ting his teeth on petabyte-per-second problems at the Large 
Hadron Collider. Mike holds a B.S. in Physics and a B.A in Philoso¬ 
phy from Michigan State University, a Ph.D. in Physics from Yale 
University, and is an Affiliate Professor of Particle Physics at the 
University of Washington. He has more than a decade’s experi¬ 
ence as a builder of the most extreme Big Data systems on earth, 
as well as extensive experience lecturing on mathematics, 
physics, data science, and philosophy at the graduate and under¬ 
graduate level. 
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Andrew C. Oliver 

__ Andrew is the president of Open Software Integra- 

fl tors, a US firm specializing in NoSQL/Big Data devel¬ 
op! ^ opment with offices in Chicago and Durham, N.C. 
v wlf- Andrew is the “Strategic Developer” columnist for 
Infoworld, which focuses primarily on application development, 
largely using cloud and NoSQL technologies. He is also a well- 
known open-source advocate (having served on the board of the 
Open Source Initiative), was the founder of the POI project, and 
worked for successful startup JBoss before it was acquired by 
Red Hat. 



Claudia Perlich 

Claudia is Chief Scientist at Media6Degrees, a startup 
that specializes at targeted online display advertising. 
Claudia received her Ph.D. in Information Systems 
from the Stern School of Business at New York Univer¬ 
sity in 2005, and holds additional graduate degrees in Computer 
Science. Claudia joined the Data Analytics Research group at the 
IBM T.J. Watson Research Center in 2004, and continued her re¬ 
search on data analytics and machine learning for complex real- 
world domains and applications. 

She is the author or 50+ scientific publications; holds multiple 
patents in the area of machine learning; has won various data- 
mining competitions and best paper awards; and speaks regularly 
at conferences and other public events. In addition, she is teach¬ 
ing courses on Data Mining and Business Analytics at Stern, and 
gives guest lectures at Wharton, Columbia and MIT. 


Krishnan Raman 

Krishnan is a data scientist at Twitter. He was formerly 
a risk quant at Bank of America, an associate at Gold¬ 
man Sachs, and an engineer at Sun Microsystems. His 
experience in building the real-time proprietary trad¬ 
ing system WebET at Goldman Sachs, and concurrent Scala sys¬ 
tems to compute the conditional value at risk of large credit 
portfolios at BAC, have put him in good stead at the Revenue 
Quality team at Twitter. His primary tools are Scala, Scalding and 
a dash of statistics and math. He has graduate degrees in math, 
computer science and mathematical finance from the University 
of Chicago. 



Jairam Ranganathan 

Jai is the Director of Product Strategy at Cloudera 
where he is responsible for planning the future 
roadmap of Cloudera products. Before Cloudera, he 
spent a decade at VMware, where amongst other 
things he was one of the developers on vMotion, storage vMotion 
and the distributed management framework for vSphere. 
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Jason Rutherglen 

Jason is a senior architect at Think Big Analytics. He is co-author¬ 
ing the upcoming “Definitive Guide on Lucene and Solr” as well 
as “Programming Hive.” He has given several talks at conventions, 
including Strata and Cassandra Summit. He co-presented a tuto¬ 
rial on distributed search at Strata New York. He has taught the 
Map/Reduce/Hive/Pig Think Big Academy class through Think 
Big Academy. 

Stefano Santoro 

Stefano is a staff software engineer for high-velocity 
database vendor VoltDB. He is an industry veteran 
who started his career working with VAX Clusters, and 
he went on to work with large telco back ends prior to 
joining VoltDB. 

Eddie Satterly 

K Jl Eddie is Chief Big Data Evangelist at Splunk and has 
] served in a variety of roles, including developer, engi- 
neer, architect and CTO over his 23 year career. He has 
been a long-time Big Data user, even before it was the 
cool thing to do. More recently, he was able to revolutionize the way 
a leading online travel agency delivers their core Web applications 
that resulted in improved user experience. He created a highly scal¬ 
able and flexible Big Data environment using best-in-breed tools, 
and as a result, was able to retire 35 other systems. Eddie has done 
guest lectures at universities, and presents at several conferences 
and symposiums yearly. He is a recognized expert in the field of Big 
Data and has presented at many global conferences on the topic. 
Eddie has a B.S. in Computer Science from Indiana University. 

Michael Segel 

Michael is a principal consultant with Think Big Analytics. As a 
principal, he is involved in working with clients, assisting with 
their strategy and implementation of Hadoop. Michael is also in¬ 
volved as an instructor with Think Big’s Academy, teaching courses 
on Hadoop Development in Java, Hive and Pig, along with HBase. 

Prior to joining Think Big, Michael ran his own consulting firm, 
developing solutions for customers around the Chicago area. 

Since 2009, Michael has been working primarily in the Big Data 
Space. He also founded the Chicago Hadoop User Group (CHUG). 

Michael received his bachelor’s degree in Computer Science 
from the College of Engineering at Ohio State University. 

Jonathan Seidman 

Jonathan is a Solutions Architect on the Partner Engi¬ 
neering team at Cloudera. Before joining Cloudera, he 
was a Lead Engineer on the Big Data team at Orbitz 
Worldwide, helping to build out the Hadoop clusters 
supporting the data storage and analysis needs of one of the most 
heavily trafficked sites on the Internet. Jonathan is also a founder 
and organizer of the Chicago Hadoop User Group and the Chicago 
Big Data Meetup, and a frequent speaker on Hadoop and Big Data 
at industry conferences such as Hadoop World, Strata and OSCON. 
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Dr. Michael Stonebraker 

Michael has been a pioneer of data base research and 
technology for more than a quarter of a century. He is 
widely recognized as one of the world’s foremost ex¬ 
perts in database technology and is noted for his in¬ 
sight in operating systems and expert systems. 

Over his career, Mike has been both a professor and leading ar¬ 
chitect for prototype development. He was the main architect of 
the INGRES relational DBMS, the object-relational DBMS, POST- 
GRES, and the federated data system, Mariposa. All three proto¬ 
types were developed at the University of California at Berkeley 
where he was a Professor of Computer Science for 25. He is the 
founder of three successful Silicon Valley startups, whose objective 
was to commercialize these prototypes. Additionally, Mike has au¬ 
thored scores of research papers on database technology, operat¬ 
ing systems and the architecture of system software services. 

Mike is presently an Adjunct Professor of Computer Science at 
M.I.T., where he is working on a variety of future-generation data- 
oriented projects. He is co-director of the new Intel Science and 
Technology Center at MIT CSAIL focusing on Big Data Analytics. 

Mike is also a co-founder of eight software startups, including 
Streambase, Vertica Systems, VoltDB, Goby, and Paradigm4. He 
was awarded the prestigious ACM System Software Award in 1992 
for his work on INGRES, the first annual Innovation award by the 
ACM SIGMOD special interest group in 1994, and the IEEE John 
Von Neumann award in 2005. Mike was elected to the National 
Academy of Engineering in 1997 and to the American Academy of 
Arts and Sciences in 2010. 



Ryan Tabora 

Ryan is a data developer at Think Big Analytics. He has 
hands-on experience with Fortune-500 clients work¬ 
ing on Big Data solutions, including technologies like 
Hadoop, HBase, Hive, Cassandra and Solr. He is co¬ 
authoring the upcoming “Definitive Guide on Lucene and Solr.” 
He co-presented a tutorial on distributed search at Strata New 
York. He helped develop the coursework for the Think Big Acad¬ 
emy HBase and Solr class. 

f Asish Thusoo 

Asish, CEO and co-founder of Qubole, a pioneering Big 
Data startup, is also the co-creator of Apache Hive and 
served as the project’s founding Vice President at the 
Apache Software Foundation. He started his career as 
an engineer at Oracle, where he contributed heavily to many core 
components of Oracle RDBMS. Ashish also ran the Data Infra¬ 
structure team at Facebook, leading the team in the creation of 
one of the largest data-processing and analytics platforms in the 
world, a platform that achieved the bold aim of making data ac¬ 
cessible to analysts, engineers and data scientists alike within the 
company. Ashish has a Bachelor’s degree in CS from IIT-Delhi, and 
a Master’s degree in CS from University ofWisconsin-Madison. 
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Kathleen Ting 

Kathleen is a Support Manager at Cloudera, and is a 
committer on the Apache Sqoop project, and has spo¬ 
ken at many Big Data conferences, such as Hadoop 
World on Map/Reduce; at HBaseCon on HBase; at 
Strange Loop on ZooKeeper; and at Hadoop Summit on Sqoop. 

Lisa Vaas 

Lisa has been writing about technology, careers, 
science and health since 1995. She rose to the lofty 
heights of Executive Editor for eWEEK, popped out 
with the 2008 crash, joined the freelancer economy, 
and now writes for venues such as Sophos's Naked Security blog, 
CIO Mag, ComputerWorld, PC Mag, IT Expert Voice, Software 
Quality Connection, Time, PBS’ Next Avenue site, Forbes, and the 
US and British editions of HP's Input/Output. 

Ravi Veeramachaneni 

Ravi is Principal Architect at RSA, a security division of 

EMC, where he architects and implements Big Data 
||B solutions for the Archer platform. Ravi is also an En- 

^ terprise Architect and Technology leader with 20+ 
years of software design, development and architecture experi¬ 
ence, specializing in e-business, service-oriented architecture, 
business-to-business integration, large-scale system design, 
cloud computing and Big Data. His experience spans a broad 
range of industries, including banking, insurance, supply chain, 
B2B e-commerce, geo-spatial, and retail. He has played various 
technology leadership roles at Discover Financial, Key Corp., 
Navteq, Nokia and Informatica, to name few. 

He has spoken at various industry events for the last couple of 
years, including Chicago Data Summit, Hadoop World, CAMP IT 
conference, Enterprise Data World, Informatica World, and Infor¬ 
matica User Groups. Ravi also frequently speaks at Navteq inter¬ 
nal TechTalks on a variety of technical topics. 

Dean Wampler 

Dean is Principal Consultant at Think Big Analytics, 
specialists in Big Data, particularly in using the 
Hadoop ecosystem of tools. He speaks frequently at 
conferences on various Big Data and other program¬ 
ming topics. Dean is the co-author of “Programming Hive,” the 
author of “Functional Programming for Java Developers,” and the 
co-author of “Programming Scala.” 
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Hotel & Travel 

Big Data TechCon will be held at the Hyatt Regency Cambridge. 
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Hyatt Regency Cambridge, Overlooking Boston 
575 Memorial Drive 
Cambridge, MA 02139-4896 
Phone:+1-617-492-1234 
Fax:+1-617-491-6906 
www.cambridge.hyatt.com 

Reservations 

Special Discounted Rates 

Room rates for Big Data TechCon 
attendees are US$199 per night for 
single/double occupancy. 

Rooms for the reduced rate are limited and are available on a 
first-come, first-serve basis. 

Click here to make your hotel reservation 

(or use the “Make Hotel Reservation” option on 
the confirmation page of your registration). 

Reservations at the reduced rate can be made through 5:00 PM 
Eastern time on March 18, 2013—assuming they don't sell out. 
The number of rooms in the discounted block is LIMITED, and 
historically rooms sell out well before the deadline. Don't wait 
until the last minute to reserve your hotel rooms! 

This rate is available throughout Big Data TechCon. Those who 
reserve their hotel rooms via this reservation link (our room 
block) will receive: 

• Complimentary wireless Internet service in their rooms. 

• Overnight self-parking discounted to $25 per day, and daily 
self-parking discounted to $15 per day. 

Hotel Highlights 

Minutes from Boston, the Hyatt Regency Cambridge hotel is 
located along the scenic Charles River overlooking the Boston 
skyline, and is in the midst of two uncommonly exciting cities, 
Boston and Cambridge. Each exhibits a unique blend of 
old-world charm coupled with youthful, contemporary 
sophistication. Cambridge is the spirit side of Boston, just 
a bridge away on the historic side. 




Driving Directions 

From the West 

From Mass Pike: Take exit 18 - Allston/Cambridge (left-hand 
exit). Follow the signs for Cambridge. Cross the River Street 
Bridge. Exit right at the end of the bridge onto Memorial Drive. 
Hyatt is half a mile up on Memorial Drive (Route 3) on the left- 
hand side. Turn left at the traffic light to access the hotel entrance 
and parking garage. 

From the South 

From 1-93: Take exit 26 - Storrow Drive/Back Bay/Cambridge. 

Stay in the right lane (Storrow Drive). Go 3/4 of a mile and take 
the second exit on the left, Government Center/Kendall Square. 
Go up the ramp and stay in the right lane. Turn right at the stop 
sign. Go across the Longfellow Bridge. Take the first left off the 
bridge and turn onto Memorial Drive (Route 3). Stay on Memorial 
Drive for approximately 1 mile on the right. At the light, turn 
right. Hyatt is on your left. 

From the North 

From 1-93 or Route 1: Take exit 26 - Storrow Drive/Back Bay/ 
Cambridge and follow “From the South” directions. 

From Logan International Airport 

Follow signs to the Mass Turnpike: 90 Boston/Williams Tunnel. 
Pay toll upon exiting airport. You will be heading west on 90. 

From 90 West, take exit 20 - Brighton/Cambridge (pay another 
toll). Out of the tollbooth, bear right following the sign for the 
Cambridge/Somerville exit. From this exit ramp, stay straight 
through two lights and cross over the bridge. After the bridge, 
turn right at the light onto Memorial Drive. Stay in the left lane. 
Continue on the overpass in the left lane. At the first light on 
Memorial Drive, make a left and then another quick left into the 
Hyatt front circle. 

Current parking rates 

Self-Parking: $35 (plus 14% tax) per day (24 hours) 

Valet Parking: $42 (plus 14% tax) per day (24 hours) 
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Getting Approval 

Gotta Get Approval? Try These 11 Time-Tested Tactics: 

1. STUDY. Note the HOW-TO classes and workshops at Big Data I 
TechCon focused on the latest Big Data technologies, especially 
those that are best aligned with your company’s existing IT infra¬ 
structure. Say that this is your first, and most practical, opportu- 
nity to bring Big Data to your business. 

2. PREPARE. Download the course catalog and circle the classes 
you want to take, and explain why the topics relate to your Big Data 
technical efforts. Show that you have found many sessions that fit 
your specific needs, and your company’s strategic goals. 

i 

3. JUSTIFY. Go in armed with all the necessary materials to 
make a good case for how your attending Big Data TechCon will 
help your company make money, save money or improve produc¬ 
tivity by helping you capture and analyze the data that drives your 
business. 

i 

4. SHARE. Promise to come back from Big Data TechCon and 
hold a brown-bag lunch session to share what you’ve learned 
with your colleagues, or even conduct formal training within your 
department. In fact, maybe you’ll want to schedule a series of 
brown-bag lunches. 

5. PLAN ■ Tell management that after you attend Big Data Tech¬ 
Con, you'll make definite action plans and recommendations to 
implement new Big Data plans, and to improve how your com¬ 
pany uses all of the data it captures. 


Special Group 



6- RELATE. Show how problems or issues you’ve recently en¬ 
countered fit with the classes at Big Data TechCon, and discuss 
the types of technology discussions you'll have with the confer¬ 
ence faculty and other IT professionals. 

7. SAVE. The tuition and travel expense of attending Big Data 
TechCon is less than many other conferences. The earlier you sign 
up, the more you save, so explain the benefit of signing up early, 
and for booking your hotel room before the cutoff. 

8. TEAM . Save even more with group discounts. Send three or 
more employees from your company, and save $100 off per per¬ 
son. Each person can take different classes and bring back even 
more valuable tips and techniques. (Sending 10 or more? Contact 
us for arrangements.) 

9. GROUP. User groups, government employees, non-profits 
and professionals employed by or attending educational institu¬ 
tions can also receive special savings. Check the website or ask 
Stacy Burris (sburris@bzmedia.com) about custom options for 
your group. 

10. LAUNCH . Classes at Big Data TechCon help you get a jump- 
start on every aspect of Big Data that you have been talking about 
implementing (but haven’t) for months. Whether it’s Hadoop, 
graph databases, NoSQL or another new technology, explain that 
you’ll find the answers here. 

11. DECIDE . While you can sign up anytime, your company 
will save the most if you beat the deadlines. Explain that you 
will help your company’s bottom line by signing up for 

Big Data TechCon today! 








012913 
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Register by 

Jan. 11 

Register by 

Feb.22 

Register by 

March 22 

After 

March 22 

Three-Day Conference 

$1,195 

$1,295 

$1,495 

April 8-10 

SAVE $400 

SAVE $300 

SAVE $200 


Exhibit Hall Only 

FREE 

FREE 

FREE 

$50 

April 9-10 





Register Online TODAY at www.BigDataTechCon.com! 


Three-Day Conference 

Registration Includes: 

• Admission to workshops and 
technical classes on April 8, 9 and 10 

• Admission to keynotes 

• Admission to the Exhibit Hall 

• Admission to all special events, 
including the Networking Reception 

• Downloadable conference materials 

• Coffee breaks and lunch 
where indicated 

Exhibit Hall Only 

Registration Includes: 

• Admission to the Exhibit Hall 

• Admission to Networking Reception 


Big Data gets Real 
at Big Data TechCon! 



How to Register 

Register online and use one of the 
following payment methods: 

Credit Card. You can use the secure 
online form to pay via credit card and 
get immediate confirmation of your regis¬ 
tration. MasterCard, Visa and American 
Express are accepted. You'll receive a 
registration record and receipt. Please 
print out these pages and bring them with 
you to the Conference. Present them at the 
Registration Desk to pick up your badge 
and course materials. 

Check. Fill out the online registration 
form. Print out the registration record and 
receipt and mail them to BZ Media LLC, 

7 High Street, Suite 407, Huntington, NY 
11743, with your payment. Online registra¬ 
tions that are mailed without payment will 
not be confirmed until payment is received. 

Purchase Order. If you register using 
a P.O., you'll be invoiced immediately for 
the registration amount. Payment must 
be received before your registration can be 
confirmed. 

Special Discounts 

You may combine one of these special 
discounts with the Early Registration 
pricing to save even more! 

Group. Group discounts will be given 
automatically if you register three or more 
people at once. You can also contact Stacy 
Burris at sburris@bzmedia.com to receive 
the $100/person discount if your group 
is unable to register at the same time. 
Contact her also for special discounts for 
groups of 10 or more. 

Government Employees. Federal, State 
and Local Government employees can 
receive an additional $100 off a r FNTRA , 
the Three-Day Conference ^ rkYst^aIiox 


price. Enter code GOV in the discount 
code field. CCR-registered indicates that 
we are listed in the primary supplier data¬ 
base for the Federal Government. 

Educational Institutions. Personnel 
employed by or attending educational 
institutions can get a $100 discount off the 
Three-Day Conference price by using the 
code EDU. 

User Groups. Contact Stacy Burris at 
sburris@bzmedia.com to see if your group 
is eligible for a discount. 

Non-Profit Organizations. Personnel 
employed by non-profit organizations can 
get a $100 discount off the Three-Day 
Conference price by using the code 
NONPROFIT. 

Cancellation and Refund Policy 

You can receive a full refund, less 
a $150 registration fee, for cancellations 
made by Friday, Feb. 22, 2013. Cancella¬ 
tions after this date are non-refundable. 
Send your cancellation in writing to 
registration@bzmedia.com. Registrations 
may be transferred to another person. 

Refunds will be processed through the 
same method of payment as the initial 
payment transaction. Credit-card refunds 
will be processed to the same credit card 
as the original payment. 

If for reasons beyond our control 
the conference cannot take place as 
scheduled, BZ Media reserves the right to 
reschedule the conference to a date and 
place of it's choosing. 

Questions 

Contact Stacy Burris, Event Director, at 
sburris@bzmedia.com or 
+1-631-421-4158 x!08. 











